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Streptococcus pneumoniae Antigens and Vaccines 

Field of the Invention 

The present invention relates to novel Streptococcus pneumoniae 
antigens for the detection of Streptococcus and for the prevention or attenuation 
of disease caused by Streptococcus. The invention further relates to isolated 
nucleic acid molecules encoding antigenic polypeptides of pneumoniae. 
Antigenic polypeptides are also provided, as are vectors, host cells and 
recombinant methods for producing the same. The invention additionally relates 
to diagnostic methods for detecting Streptococcus gene expression. 

Background of the Invention 

Streptococcus pneumoniae has been one of the most extensively studied 
microorganisms since its first isolation in 1881. It was the object of many 
investigations that led to important scientific discoveries. In 1928, Griffith 
observed that^when heat-killed encapsulated pneumococci and live strains 
constitutively lacking any capsule were concomitantly injected into mice, the 
nonencapsulated could be converted into encapsulated pneumococci with the 
same capsular type as the heat-killed strain. Years later, the nature of this 
transforming principle," or carrier of genetic information, was shown to be 
DNA. (Avery, O.T., et al ., J. Exp. Med., 79:137-157 (1944)). 

In spite of the vast number of publications on 5. pneumoniae many 
questions about its virulence are still unanswered, and this pathogen remains a 
major causative agent of serious human disease, especially community-acquired 
pneumonia. (Johnston, R.B., et al.. Rev. Infect. Dis. 7J(Suppl. 6):S509-517 
(1991)). In addition, in developing countries, the pneumococcus is responsible 
for the death of a large number of children under the age of 5 years from 
pneumococcal pneumonia. The incidence of pneumococcal, disease is highest in 
infants under 2 years of age and in people over 60 years of age. Pneumococci 
are the second most frequent cause (after Haemophilus influenzae type b) of 
bacterial meningitis and otitis media in children. With the recent introduction of 
conjugate vaccines for H. influenzae type b, pneumococcal meningitis is likely 
to become increasingly prominent. S. pneumoniae is the most important 
etiologic agent of community-acquired pneumonia in adults and is the second 
most common cause of bacterial meningitis behind Neisseria meningitidis. 

The antibiotic generally prescribed to treat S. pneumoniae is 
benzylpenicillin, although resistance to this and to other antibiotics is found 
occasionally. Pneumococcal resistance to penicillin results from mutations in its 
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penicillin-binding proteins. In uncomplicated pneumococcal pneumonia caused 
by a sensitive strain, treatment with penicillin is usually successful unless 
started too late. Erythromycin or clindamycin can be used to treat pneumonia in 
patients hypersensitive to penicillin, but resistant strains to these drugs exist. 
Broad spectrum antibiotics (e.g., the tetracyclines) may also be effective, 
although tetracycline-resistant strains are not rare. In spite of the availability of 
antibiotics, the mortality of pneumococcal bacteremia in the last four decades 
has remained stable between 25 and 29%. (Gillespie, S.H., et al., J. Med. 
Microbiol. 28: 237-248 (1989). 

S. pneumoniae is carried in the upper respiratory tract by many healthy 
individuals. It has been suggested that attachment of pneumococci is mediated 
by a disaccharide receptor on fibronectin, present on human pharyngeal 
epithelial cells. (Anderson, B.J., et al., J. Immunol. 742:2464-2468 (1989). 
The mechanisms by which pneumococci translocate from the nasopharynx to 
the lung, thereby causing pneumonia, or migrate to the blood, giving rise to 
bacteremia or septicemia, are poorly understood. (Johnston, R.B., et al.. Rev. 
Infect. Dis. 73(Suppl. 6):S509-517 (1991). 

Various proteins have been suggested to be involved in the pathogenicity 
of S. pneumoniae, however, only a few of them have actually been confirmed 
as virulence factors. Pneumococci produce an IgAl protease that might 
interfere with host defense at mucosal surfaces. (Komfield, S.J., et al.. Rev. 
Inf. Dis. 3: 521-534 (1981). S. pneumoniae also produces neuraminidase, an 
enzyme that may facilitate attachment to epithelial cells by cleaving sialic acid 
from the host glycolipids and gangliosides. Partially purified neuraminidase 
was observed to induce meningitis-like symptoms in mice; however, the 
reliability of this finding has been questioned because the neuraminidase 
preparations used were probably contaminated with cell wall products. Other 
pneumococcal proteins besides neuraminidase are involved in the adhesion of 
pneumococci to epithelial and endothelial cells. These pneumococcal proteins 
have as yet not been identified. Recently, Cundell et al. , reported that peptide 
permeases can modulate pneumococcal adherence to epithelial and endothelial 
cells. It was, however, unclear whether these permeases function directly as 
adhesions or whether they enhance adherence by modulating the expression of 
pneumococcal adhesions. (DeVelasco, E.A., et al.. Micro. Rev. 59. 591-603 
(1995). A better understanding of the virulence factors determining its 
pathogenicity will need to be developed to cope with the devastating effects of 
pneumococcal disease in humans. 
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vectors and hos, cells and fo, using rhese vectors for the production of S. 
pneumoniae polypeptides or peptides by recombinant techniques. 

The invention further provides isolated 5. pneumomae polypeptrdes 
having an amino acid sequence selected from the group cons.st.ng of an anuno 
acid sequence of any of the polypeptides described in Table 1 

The polypeptides of the present invention also include polypepti e 
havin'* an amino acid sequence with a. leas, 70% similarity, and more preferably 
at tet 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% smulanty to 
those described in Table 1, as well as polypeptides having an anuno 
sequence at least 70% identical, more preferably at least 75% identical, and 
more preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% idenucal o 
those above; as well as isolated nucleic acid molecules encoding 

polypepudes. ^ provides a vaccine, preferably a 

mult, -component vaccine comprising one or more of the 
polynucleotides, or polypeptides dereribed in Table 1, or fragments thereof 
together with a phamuceudcally acceptable diluent, earner, « 
wherein the S. pneumoniae polypeptide(s) are present in an amount effective to 
elicit an immune .espouse ,0 members of die Sirepiococcus genus ,n 
The S. pneumoniae polypeptides of the present invention may 
combined with one or more immunogens of one or more other 
non-streptococcal organisms to produce a multi-component vaccine intend 
elicit an immunological response against members of die Sirepiococcus genus 

and, optionally, one or more non-streptococcal organ, sms. 

The vaccines of the present invention can be administered in DNA 
form e g "naked- DNA, wherein the DNA encodes one or more streptococc 
polypepddes and, optionally, one or more polypeptides of a „on-suept~occtd 
organism. The DNA encoding one or more polypeptides may be consmicted 

such that these polypeptides are expressed fusion proteins. 

The vaccines of the present invention may also be admin 
component of a genetically engineered organism. Thus, a 8™““ y 
engineered organism which expresses one or more S. pneumomae 
Z be adnuiustered to an animal. For example, such a genehcally engrneered 
organism may contain one or more 5. pneumoniae polypeptides off he pes^ 

invention intracellularly, on its cell surface, or in its penplasmic sp 

sTh ^genetically engineered organism may secrete one or more 5. f— 
polypeptides. 
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The vaccines of the present invention may be co-administered to an 
animal with an immune system modulator ( e.g ., CD86 and GM-CSF). 

The invention also provides a method of inducing an immunological 
response in an animal to one or more members of the Streptococcus genus, 
preferably one or more isolates of the 5. pneumoniae genus, comprising 
administering to the animal a vaccine as described above. 

The invention further provides a method of inducing a protective 
immune response m an animal, sufficient to prevent or attenuate an infection by 
members of the Streptococcus genus, preferably at least 5. pneumoniae, 
comprising administering to the animal a composition comprising one or more, 
of the polynucleotides or polypeptides described in Table 1, or fragments 
thereof. Furher, these polypeptides, or fragments thereof, may be conjugated 
to another immunogen and/or administered in admixture with an adjuvant. ° 

The invention further relates to antibodies elicited in an animal by the 
administration of one or more 5. pneumoniae polypeptides of the present 
invention and t p methods for producing such antibodies. 

The invention also provides diagnostic methods for detecting the 
expression of genes of members of the Streptococcus genus in an animal ° One 
such method involves assaying for the expression of a gene encoding S. 
pneumoniae peptides in a sample from an animal. This expression may be 
assayed either directly (e.g., by assaying polypeptide levels using antibodies 
elicited in response to amino acid sequences described in Table 1) or indirectly 
(e.g., by assaying for antibodies having specificity for amino acid sequences 
described in Table 1). An example of such a method involves the use of the 

polymerase chain reaction (PCR) to amplify and detect Streptococcus nucleic 
acid sequences. 

The present invention also relates to nucleic acid probes having all or 
part of a nucleotide sequence described in Table 1 (shown as.SEQ ID NO:l, 
SEQ ID NO:3, SEQ ID NO:5, and so on through SEQ ID NO:225) which are 
capable of hybridizing under stringent conditions to Streptococcus nucleic acids. 
The invention further relates to a method of detecting one or more Streptococcus 
nucleic acids in a biological sample obtained from an animal, said one or more 
nucleic acids encoding Streptococcus polypeptides, comprising: (a) contacting 
the sample with one or more of the above-described nucleic acid probes, under 
conditions such that hybridization occurs, and (b) detecting hybridization of said 

one or more probes to the Streptococcus nucleic acid present in the biological 
sample. 
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The invention also includes immunoassays, including an immunoassay 
for detecting Streptococcus , preferrably at least isolates of the S. pneumoniae 
genus, comprising incubation of a sample (which is suspected of being infected 
with Streptococcus ) with a probe antibody directed against an antigen/epitope of 
S. pneumoniae, to be detected under conditions allowing the formation of an 
antigen-antibody complex; and detecting the antigen-antibody complex which 
contains the probe antibody. An immunoassay for the detection of antibodies 
which are directed against a Streptococcus antigen comprising the incubation of 
a sample (containing antibodies from a mammal suspected of being infected 
with Streptococcus) with a probe polypeptide including an epitope of S. 
pneumoniae , under conditions that allow the formation of antigen-antibody 
complexes which contain the probe epitope containing antigen. 

Some aspects of the invention pertaining to kits are those for: 
investigating samples for the presence of polynucleotides derived from 
Streptococcus which comprise a polynucleotide probe including a nucleotide 
sequence selected from Table 1 or a fragment thereof of approximately 15 or 
more nucleotides, in an appropriate container; analyzing the samples for the 
presence of antibodies directed against a Streptococcus antigen made up of a 
polypeptide which contains a S. pneumoniae epitope present in the polypeptide, 
in a suitable container; and analyzing samples for the presence of Streptococcus 
antigens made up of an anti-S. pneumoniae antibody , in a suitable container. 

Detailed Description 

The present invention relates to recombinant antigenic S. pneumoniae 
polypeptides and fragments thereof. The invention also relates to methods for 
using these polypeptides to produce immunological responses and to confer 
immunological protection to disease caused by members of the genus 
Streptococcus, at least isolates of the S. pneumoniae genus. The invention 
further relates to nucleic acid sequences which encode antigenic S. pneumoniae 
polypeptides and to methods for detecting S. pneumoniae nucleic acids and 
polypeptides in biological samples. The invention also relates to 5. 
pneumoniae- specific antibodies and methods for detecting such antibodies 
produced in a host animal. 

Definitions 

The following definitions are provided to clarify the subject matter 
which the inventors consider to be the present invention. 
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As used herein, the phrase "pathogenic agent" means an agent which 
causes a disease state or affliction in an animal. Included within this definition, 
for examples, are bacteria, protozoans, fungi, viruses and metazoan parasites 
which either produce a disease state or render an animal infected with such an 
organism susceptible to a disease state (e.g., a secondary infection). Further 

included are species and strains of the genus Streptococcus which produce 
disease states in animals. 

As used herein, the term "organism" means any living biological 
system, including viruses, regardless of whether it is a pathogenic agent. 

As used herein, the term " Streptococcus " means any species or strain of 
bacteria which is members of the genus Streptococcus. Such species and 
strains are known to those of skill in the art, and include those that are 
pathogenic and those that are not. 

As used herein, the phrase ‘one or more S. pneumoniae polypeptides of 
the present invention" means polypeptides comprising the amino acid sequence 
of one or mor^ of the S. pneumoniae polypeptides described in Table 1 and 
disclosed as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and so on through 
SEQ ID NO: 226. These polypeptides may be expressed as fusion proteins 
wherein the S. pneumoniae polypeptides of the present invention are linked to 
additional ammo acid sequences which may be of streptococcal or non- 
streptococcal origin. This phrase further includes polypeptide comprising 
fragments of the S. pneumoniae polypeptides of the present invention. 

Additional definitions are provided throughout the specification. 

Explanation of Table 1 

Table 1, below, provides information describing 113 open reading 
frames (ORFs) which encode potentially antigenic polypeptides of S° 
pneumoniae of the present invention. The table lists the ORF identifier which 
consists of the letters SP, which denote 5. pneumoniae, followed immediately 
by a three digit numeric code, which arbitrarily number the potentially antigenic 
polypeptides of S. pneumoniae of the present invention and the nucleotide or 
ammo acid sequence of each ORF and encoded polypeptide. The table further 
correlates the ORF identifier with a sequence identification number (SEQ ID 
NO:). The actual nucleotide or amino acid sequence of each ORF identifier is 
also shown in the Sequence Listing under the corresponding SEQ ID NO. 

Thus, for example, the designation "SP126" refers to both the 
nucleotide and amino acid sequences of S. pneumoniae polypeptide number 126 
of the present invention. Further, "SP126" correlates with the nucleotide 
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sequence shown as SEQ ID NO:223 and with the amino acid sequence shown 
as SEQ ID NO:224 as is described in Table 1. 

The open reading frame within each "ORF" begins with the second 
nucleotide shown. Thus, the first codon for each nucleotide sequence shown is 
bases 2-4, the second 5-7, the third 8-10, and so on. 

Explanation of Table 2 

Table 2 lists the antigenic epitopes present in each of the S. pneumoniae 
polypeptides described in Table 1 as predicted by the inventors. Each S. 
pneumoniae polypeptide shown in Table 1 has one or more antigenic epitopes 
described in Table 2. It will be appreciated that depending on the analytical 
criteria used to predict antigenic determinants, the exact address of the 
determinant may vary slightly. The exact location of the antigenic determinant 
may shift by about 1 to 5 residues, more likely 1 to 2 residues, depending on 
the criteria used. Thus, the first antigenic determinant described in Table 2, 
M Lys-l to lie- 10” of SP001, represents a peptide comprising the lysine at 
position 1 in SEQ ID NO:2 through and including the isoleucine at position 10 
in SEQ ID NO:2, but may include more or fewer residues than those 10. It will 
also be appreciated that, generally speaking, amino acids can be added to either 
terminus of a peptide or polypeptide containing an antigenic epitope without 
affecting its activity, whereas removing residues from a peptide or polypeptide 
containing only the antigenic determinant is much more likely to destroy 
activity. It will be appreciated that the residues and locations shown described 
in Table 2 correspond to the amino acid sequences for each ORF shown in 
Table 1 and in the Sequence Listing. 

Explanation of Table 3 

Table 3 shows PCR primers designed by the inventors for the 
amplification of polynucleotides encoding polypeptides of the present invention 
according to the method of Example 1. PCR primer design is routine in the art 
and those shown in Table 3 are provided merely for the convenience of the 
skilled artisan. It will be appreciated that others can be used with equal success. 

For each primer, the table lists the corresponding ORF designation from 
Table 1 followed by either an "A" or a "B". The "A" primers are the 5' primers 
and the ”B" primers 3'. A restriction enzyme site was built into each primer to 
allow ease of cloning. The restriction enzyme which will recognize and cleave a 
sequence within each primer is shown in Table 3, as well, under the heading 
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"RE" for restriction enzyme. FtnaJJy the sequence identifier is shown in Table 3 
' eac pnmer for eas >' correlation with the Sequence Listing 

Selection of Nucleic Acid Sequences Encoding Antigenic S 
pneumoniae Polypeptides 8 ' 

presented"! „T7 inVenIi ° D Pr ° Vid “ 2 " umber of <** ^om those 
primed „ the fragments of the 5. pneumoniae genome which may prove 

e generation of a protective immune response. The sequenced 5 

pnenmontae genomic DNA was obtamed from a suborned isolate of s' 

Ty^cZe cT T U ' 8 ‘ 9 '' 1,25 bee " deposi,ed “ ,hc A«*ncan 

ype Culture Collect, on, as a convenience to those of skill in the art The S 
pneuntontae isolate was deposited on October 10, ,996 a, the ATCC P 3 o, 
Pari^aum Drive, Rockville, Maraud 20853. and given access, on nu^! 

genomic library constructed from DNA isolated from the S 

Z~T V V' S ° deP ° S,Kd ^ ATCC °" “• «99« »d 

ATCC IJepost, No. 97755. A more complete listing of the sequence 
o tame rom the S. pneumoniae genome may be found in co-pendim* U s 

h — ‘ APP ' ,Ca “° n ^ N °' « V02 9-9“. 10/31/96, incorporated 

fragments 7ZT ‘ nttKCy ' S ° “ ° RFS C ° Mained ,n ,he s “ b “‘ of 
.. *77 °f the pneumoniae genome d,sclosed herein were derived through 

se of a number of screening criteria detailed below. 

The selected ORFs do not consist of complete ORFs. Although a 
polypepttde representing a complete ORF may be the closes, approximate ’of a 

OWm "T! V ° rSaniSm ' " ^ a ' WiyS “ eX P ress * complete 

ORF a heterologous system. I, may be challenging to express and purify a 

h,ghly hydrophobic protein by common laboratory methods. Thus L 

polypepude vaccine candidate described herein may have been modified 

tghtly to simplify the producrion of recombinant protein.- For example 

nucleo„de sequences which encode highly hydmphobic domains, such as those’ 

found a. the ajrnno terminal signal sequence, have been excluded from !o™ 

7 “ VUro expression of the polypeptides. Furthennore, any 
highly hydrophobtc amino acid sequences occutring a, the carboxy teiminus 

.o eenexcluded from, he recombinant expression consmuc. Thus in 

embodiment, a polypepride which represent a truncated or modified ORF 

may be used as an antigen. 

While numerous methods are known in me art for selecting potentially 
immunogenic polypeptides, many of the ORFs disclosed herein were selected 
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on the basts of screening all theorettcal 5. pneumoniae ORFs for several aspects 
of potential immunogenic ity. One set of selection criteria are as follows. 

1 Type I signal sequence: An amino terminal type I signal sequence 
generally directs a nascent protein across the plasma and outer membranes to the 
exterior of the bacterial cell. Experimental evidence obtained from studies with 
Escherichia coli suggests that the typical type I signal sequence consists of the 
following biochemical and physical attributes (Izard, J. W. and Kendall, D. . 
Mol Microbiol. 13:765-773 (1994)). The length of the type I signal sequence 
is approximately 15 to 25 primarily hydrophobic amino acid residues with a net 
positive charge in the extreme amino terminus. In addition, the central region o 
the signal sequence adopts an alpha-helical conformation in a hydrophobic 
environment. Finally, the region surrounding the actual site of cleavage is 
ideally six residues long, with small side-chain amino acids m the -1 and -3 

positions. . , 

2. Type IV signal sequence: The type IV signal sequence is an example 

of the several types of functional signal sequences which exist in addition to the 
type 1 signal sequence detailed above. Although functionally related, the type 
IV signal sequence possesses a unique set of biochemical and physical attributes 
(Strom, M. S. and Lory. S„ J. ***** 174:7345-735 1 ( 1992)). These are 
typically six to eight amino acids with a net basic charge followed by an 
additional sixteen to thirty primarily hydrophobic residues. The cleavage sue of 
a type IV signal sequence is typically after the initial six to eight amino acids at 
the extreme amino terminus. In addition, type W signal sequences generally 
contain a phenylalanine residue at the +1 site relative to the cleavage site. 

3 Lipoprotein: Studies of the cleavage sites of twenty-six bactenal 
lipoprotein precursors has allowed the definition of a consensus ammo acid 
sequence for lipopiotein cleavage. Nearly three-fourrhs of the ^bactemd 
lipoprotein precursors examined contained the sequence L-(A.S)-(G,A -t- at 
positions -3 to +1, relative to the point of cleavage (Hayash., S. and Wu, H. 

C., J. Bioenerg. Biomembr. 22:451-471 (1990)). 

4. LPXTG motif: It has been experimentally determined that most 

anchored proteins found on the surface of gram-positive bacteria possess a 
hiohly conserved carboxy terminal sequence. Mom than fifty such proteins 
from organisms such as 5. pyogenes, S. ntutans. E. faecalis, S. pnetononute 
and others, have been identified based on their extracellular location .and 
carboxy terminal amino acid sequence (Fischetti, V. A., ews 

62-405-410(1996)). The conserved region consists of six charged amino aci s 

at 'the extreme cmboxy tetminus coupled to 15-20 hydrophobic amino acids 
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presumed to function as a tmnsmembrane domain. Immediately adjacent to tfte 
transmembrane domatn ,s a six * actd sequence conserved * 

proteins exanuned. The amino acid sequence of this region is L-P-X-T-G X 
where X is any amino acid. U X ' 

An algonthm for selecting antigenic and immunogenic 5 
polypeptides including tbe foregoing critena seas deveCd ZZZ 
algonthm by the inventors to select immunologically useful S 2 
pojypeptides resulted in the selecdon of a number If ,^1 ~T 

yp p ides comprising the polypeptides identified in this „ r ouo mav he 
produced by techniques standard the an and as funher described hem” 

Nucleic Acid Molecules 

The present invention provides isolated nucleic acid molecules 
compos, ng polynucleotides encoding the * pneumoniae 

smTIiI “ qUenCeS <teSCribed Table 1 and shown “ SEQ ID NO:2 

d t H n / Q ID NO:6 ’ a " d S ° °" Ihr0Ugh SEQ 113 no ' 226, which were 
determined by sequencing the genome of 5. pmumoniae Md J ^ 

putative immunogens. Elected as 

Unless otherwise indicated, all nucleotide sequences detennined bv 
equenctng a DNA molecule herein were detennined using an automated DnI 
sequencer (such as the Mtal 373 from Applied Biosystems. ,„c ) ^ 
amtno acd sequences of polypeptides encoded by DNA molecules dt,el„* 

^fore^ ^ ^ ° f ° NA ~ S 

Therefore, as ts known m the an for any DNA sequence determined by this 

automated approach, any nucleotide sequence determmed hereto may contain 

” “"*■ NUde0tide tletermined by automata are typilTl 

east taut *. identical, more typify . leas, about 95% IO at 

lecule. The acmal sequence can be more precisely determined by other 
W^hes including manual DNA sequencing methods well known in the art 

ntclt°se W " “ Singfc inSeni ° n ° r ” ■ ««»tad 

nucleotide sequence compared to the actual sequence will cause a frame shift in 

sequence” ” dd " UC ' e ° Ude Seq “ nce such that lhe predicted amino acid 
sequence encoded by a detennined nucleotide sequence will be completely 

different from the amino acid sequence actually encoded by the sequenced DNA 

molecule, beginning a. the pom, of such an insenion or deletion 

Unless otherwise indicated, each "nucleotide sequence" set fonh herein 

is presented as a sequence of deoxyribonucleotides (abbreviated A, G , C and 
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T1 However, by "nucleotide sequence" of a nuclerc ac.d molecule or 
polynucleotide is intended, for a DNA molecule or polynucleotide, a sequence 
of deoxyribonucleotides, and for an RNA molecule or polynucleotide, the 
corresponding sequence of ribonucleotides (A, G, C and U), where e 
thymidine deoxyribonucleoiide (T) tn tire specified deoxynbonucleotide 
sequence is replaced by tire ribonucleotide uridine (U). For instance reference 
to an RNA molecule having a sequence described in Table 1 set forth usrng 
deoxynbonucleotide abbreviations is intended to indicate an RNA molecule 
having a sequence in which each deoxyribonucleot.de A, G or C descnbe in 
Table 1 has been replaced by the corresponding ribonucleotide A, or , an 
each deoxynbonucleotide T has been replaced by a ribonucleotide U 

Nucleic acid molecules of the present invention may be rn the form o 
RNA. such as mRNA, or in the form of DNA, including for tnstance, cDNA 
and genomic DNA obtarned by cloning or produced synthetically. The DNA 
may be double-stranded or single-stranded. Single-stranded DNA or RNA may 
be the coding, strand, also known as the sense strand, or may be the 

non-coding strand, also referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nuclerc acrd 
molecule, DNA or RNA, which has been removed from its native envrronmenti 
FO, example, recombinant DNA molecules contained in a vector are consrdered 
isolated for the purposes of the present invention. Further examples of rsolated 
DNA molecules include recombinant DNA molecules marntarned u, 
heterologous host cells or purified (partially or substantially) DNA molecules nr 
solution. Isolated RNA molecules include in vivo or inM.ro RNA trrunrcnpts of 
tire DNA molecules of the present invention. Isolated nuclerc acrd molecules 
according to the preren. invention further include such molecules produced 

synthetically^ ^ mo lecules of the present invention include DNA 

molecules comprising a nucleotide sequence described in Table 1 “d showr, as 
SEO ID NOtl, SEQ ID NO:3. SEQ ID NO:5, and so on through SEQ ID 
NO:225; DNA molecules comprising the coding sequences f “ P ° ly ^ ) 
described in Table 1 and shown as SEQ ID NOt2, SEQ ID NOA SEQ JD 
NO-6 and so on through SEQ ID NO:226; and DNA molecules whrch compn 
sequences substantially different from those described above bu, whrch, due o 
the degeneracy of tire genetic code, still encode the S. P—es 

described ,» Table 1. Of course, the genetic code is well known m ^ 
Thus, it would be routine for one skilled in the art to generate such de„enetate 

variants. 
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The invention also provides nucleic acid molecules having sequences 
complementary to any one of those described in Table I. Such isolated 
molecules, particularly DNA molecules, am useful as probes for detecting 

expression of Streptococcal genes, for instance, by Northern blot analysis or the 
polymerase chain reaction (PCR). 

The present invention is further directed to fragments of the isolated 
nucleic acid molecules described herein. By a fragment of an isolated nucleic 
acid molecule having a nucleotide sequence described in Table 1, is intended 
fragments a. leas, about 15 m, and more prefetabiy at leas, about 17 still 
more preferably a. leas, about 20 and even more preferably, a. leas, about 25 
n. ,n length which are useful as diagnostic probes and primers as discussed 
herein. Of course, larger fragments 50-100 nt in length are also useful 
according to the present invention as are fragments corresponding to most if 
not all, of a nucleotide sequence described in Table I. By a fragment a. least’20 
nt in length, for example, is intended fragments which include 20 or more 
contiguous bases of a nucleotide sequence as described in Table 1. Since the 
nucleotide sequences identified in Table I are provided as SEQ O NO: 1 SEO 
ID NO:3, SEQ ID NO:5, and so on through SEQ ID NO:225, generating such 

A fragments would be routine to the skilled artisan. For example, such 
fragments could be generated synthetically. 

Preferred nucleic acid fragments of the present invention also include 
nucleic acid molecules comprising nucleotide sequences encodinv 
epitope-beanng portions of the S. pneumoniae polypeptides identified in Table 
Such nucleic acid fragments of the present invention include, for example 
nucleotide sequences encoding polypeptide fragments comprising from about 
*e amino tetmina! residue to about the carboxy temunal residue of each 
fragment shown in Table 2. The above referred ,0 polypeptide fragments are 
antigenic regions of the 5. pneumoniae polypeptides identified in Table l. 

In another aspect, the invention provides isolated nucleic acid molecules 

comprising polynucleotides which hybridize under stringent hybridization 

conditions to a portion of a polynucleotide in a nucleic acid molecule of the 

invention described above, for instance, a nucleic acid sequence identified in 

able 1. By "stringent hybridization conditions" is intended overnight 

incubation at 42 C in a solution comprising: 50% formamide, 5x SSC (150 mM 

NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x 

Denhardfs solution, 10% dextran sulfate, and 20 g/ml denatured, sheared 

salmon sperm DNA, followed by washing the filters in O.lx SSC at about 
65 C. 
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By polynucleotides which hybridize to a "portion" of a polynucleotide is 
intended polynucleotides (either DNA or RNA) which hybridize to at least about 
15 nucleotides (nt), and more preferably at least about 17 nt, still more 
preferably at least about 20 nt, and even more preferably about 25-70 nt o e 
reference polynucleotide. These are useful as diagnostic probes and primers as 

discussed above and in more detail below. 

Of course, polynucleotides hybridizing to a larger portion o t e 
reference polynucleotide, for instance, a portion 50-100 nt in length, or even to 
the entire length of the reference polynucleotide, are also useful as probes 
according to the present invention, as are polynucleotides corresponding to 
most, if not all, of a nucleotide sequence as identified in Table 1 . By a portion 
of a polynucleotide of "at least 20 nt in length," for example, is intended 20 or 
more contiguous nucleotides from the nucleotide sequence of the reference 
polynucleotide 0 e.g ., a nucleotide sequences as described in Table 1). As note 
above, such portions are useful diagnostically either as probes according to 
conventional DNA hybridization techniques or as primers for amplification 
target sequence by PCR, as described in the literature (for instance, in Molecular 
Cloning, A Laboratory Manual, 2nd. edition, Sambrook, J-, Fntsch, E. F. an 
Maniatis, T., eds.. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
N.Y. (1989), the entire disclosure of which is hereby incorporated herein by 

reference). 

Since nucleic acid sequences encoding the S. pneumoniae polypeptides 
of the present invention are identified in Table 1 and provided as SEQ ID NO: , 
SEQ ID NO-3 SEQ ID NO:5, and so on through SEQ ID NO:225, generating 
polynucleotides which hybrid, ze to portions of these sequences would be 
routine to the skilled artisan. For example, die hybridizing polynucleotides of 
the present invention could be generated synthetically according to nown 

techniques. - , . , 

As indicated, nucleic acid molecules of the present mventron whieh 

encode S. pneumoniae polypeptides of the present invention may include, but 

are not limited to those encoding the amino acid sequences of the polypepudes 

by themselves; and additional coding sequences which code for a tuon 

amino acids, such as those which provide additional functional, ties. Thus, dm 

sequences encoding these polypeptides may be fused to a marker sequence 

such as a sequence encoding a peptide which facilitates punficauon of the fused 

polypeptide. In certain preferred embodiments of this aspect of the invention, 

the marker amino acid sequence is a he«-his.idine pept.de, such as the tag 

provided in a pQE vector (Qiagen, Inc.), among others, many of w ic ore 
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commerciaHy available. As described by Geniz and colleagues (P mc . NaIl 

ca . a. USA 86:821-824 (1989)), for instance, hexa-histidine provides for 
convenient purification of the resulting fusion protein. 

Thus, the present invention also includes genetic fusions wherein the 5 
pneumomae nuclerc acid sequences coding sequences identified in Table 1 am 
inked to additional nucleic acid sequences to produce fusion proteins These 
fusion proteins may include epitopes of streptococcal or hon-s, rep ,ococcal 
origin designed to produce proteins having enhanced immunogenicity. Further 
the fusion proteins of the present invention may contain antigenic determinants’ 
known to provide helper T-cell stimulation, peptides encoding sites for 
post-translational modifications which enhance immunogenicity 
acylation), peptides which facilitate purification (e.g., histidine "tag"), or amino 

acd sequences which target die fusion protein to a desired location a 

heterologous leader sequence). 

In all cases of bacterial expression, an N-terminal methionine residues is 
added. In many cases, however, the N-terminal methionine residues is cleaved 
off post-translaiionally. Thus, the invention includes polypeptides shown in 
I able 1 with, and without an N-termainal methionine. 

The present invention thus includes nucleic acid molecules and 
sequences which encode fusion proteins comprising one or more J 
pneumoniae polypeptides of die present invention fused ,o an amino acid 
sequence which allows for post-translational modification to enhance 
immunogenicity. This post-transladonal modification may occur either in viiro 
or w en the fusion protein is expressed in vivo in a host cell. An example of 

sue a modification is the introduction of an amino acid sequence which results 

in the attachment of a lipid moiety. 

Thus, as indicated above, the present invention includes genedc fusions 
Wherein a S. pneiunoniae nucleic acid sequence identified in Table 1 is linked to 
a nucleotide sequence encoding another amino acid sequence. These other 
am.no acd sequences may be of streptococcal origin (e.g., another sequence 
selected from Table 1) or non-streptococcal origin. 

The present invention further relates to variants of the nucleic acid 
molecules of the present invention, which encode portions, analogs or 
envatives of the S. pneumoniae polypeptides described in Table I. Variants 
may occur naturally, such as a natural allelic variant. By an "allelic variant" is 
intended one of several alternate forms of a gene occupying a given locus on a 
chromosome of an organism (Genes It, Lewin, B„ ed„ John Wiley & Sons 
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New York (1985)). Non-naturally occurring variants may be produced using 
art-known mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, 
deletions or additions. The substitutions, deletions or additions may involve 
one or more nucleotides. These variants may be altered in coding regions, 
non-coding regions, or both. Alterations in the coding regions may produce 
conservative or non-conservative amino acid substitutions, deletions or 
additions. Especially preferred among these are silent substitutions, additions 
and deletions, which do not alter the properties and activities of the S. 
pneumoniae polypeptides disclosed herein or portions thereof. Silent 
substitution are most likely to be made in non-epitopic regions. Guidance 
regarding those regions containing epitopes is provided herein, for example, in 
Table 2. Also especially preferred in this regard are conservative substitutions. 

Further embodiments of the invention include isolated nucleic acid 
molecules comprising a polynucleotide having a nucleotide sequence at least 
90% identical v and more preferably at least 95%, 96%, 97%, 98% or 99% 
identical to: (a) a nucleotide sequence encoding any of the amino acid sequences 
of the polypeptides identified in Table 1; and (b) a nucleotide sequence 
complementary to any of the nucleotide sequences in (a) above. 

By a polynucleotide having a nucleotide sequence at least, for example, 
95% "identical" to a reference nucleotide sequence encoding a S. pneumoniae 
polypeptide described in Table 1, is intended that the nucleotide sequence of the 
polynucleotide is identical to the reference sequence except that the 
polynucleotide sequence may include up to five point mutations per each 100 
nucleotides of the reference nucleotide sequence encoding the subject 5. 
pneumoniae polypeptide. In other words, to obtain a polynucleotide having a 
nucleotide sequence at least 95% identical to a reference nucleotide sequence, up 
to 5% of the nucleotides- in the reference sequence may be deleted or substituted 
with another nucleotide, or a number of nucleotides up to 5% of the total 
nucleotides in the reference sequence may be inserted into the reference 
sequence. These mutations of the reference sequence may occur at the 5’ or 3’ 
terminal positions of the reference nucleotide sequence or anywhere between 
those terminal positions, interspersed either individually among nucleotides in 
the reference sequence or in one or more contiguous groups within the reference 

sequence. 

Certain nucleotides within some of the nucleic acid sequences shown m 
Table 1 were ambiguous upon sequencing. Completely unknown sequences are 
shown as an “N”. Other unresolved nucleotides are known to be either a 
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purine, shown as “R”, or a pyrimidine, shown as “Y”. Accordingly, when 
determining identity between two nucleotide sequences, identity is met where 
any nucleotide, including an “R”, “Y” or “N”, is found in a test sequence and at 
the corresponding position in the referece sequence (from Table 1). Likewise, 
an A, G or “R” in a test sequence is identical to an “R” i n the reference 

sequence; and a T, C or “Y” in a test sequence is identical to a “Y” in the 
reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at 
least 90%, 95%, 96%, 97%, 98% or 99% identical to, for instance, a nucleotide 
sequence described in Table 1 can be determined conventionally using known 
computer programs such as the Bestfit program (Wisconsin Sequence Analysis 
Package, Version 8 for Unix; Genetics Computer Group, University Research 
Park, 575 Science Drive, Madison, WI 53711). Bestfit uses the local 
homology algorithm of Smith and Waterman (Advances in Applied Mathematics 
2:482-489 (1981)), to find the best segment of homology between two 
sequences. WJien using Bestfit or any other sequence alignment program to 
determine whether a particular sequence is, for instance, 95% identical to a 
reference sequence according to the present invention, the parameters are set, of 
course, such that the percentage of identity is calculated over the full length of 
the reference nucleotide sequence and that gaps in homology of up to 5% of the 
total number of nucleotides in the reference sequence are allowed. 


The present application is directed to nucleic acid molecules at least 
90%, 95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequences 
described m Table 1. One of skill in the art would still know how to use the 
nucleic acid molecule, for instance, as a hybridization probe or a polymerase 
chain reaction (PCR) primer. Uses of the nucleic acid molecules of the present 
invention include, inter alia, (1) isolating Streptococcal genes or allelic variants 
thereof from either a genomic or cDNA libraiy and (2) Northern Blot or PCR 
analysis for detecting Streptococcal mRNA expression. 

Of course, due to the degeneracy of the genetic code, one of ordinary 
skill m the art will immediately recognize that a large number of nucleic acid 
molecules having a sequence at least 90%, 95%, 96%, 97%, 98%, or 99% 
identical to a nucleic acid sequence identified in Table 1 will encode the same 
polypeptide. In fact, since degenerate variants of these nucleotide sequences all 
encode the same polypeptide, this will be clear to the skilled artisan even 
without performing the above described comparison assay. 

It will be further recognized in the art that, for such nucleic acid 
molecules that are not degenerate variants, a reasonable number will also encode 



WO 98/18930 


PCT/US97/19422 


proteins having antigenic epitopes of the 5. pneumoniae polypeptides of the 
present invention. This is because the skilled artisan is fully aware of amino 
acid substitutions that are either less likely or not likely to significantly effect the 
antigenicity of a polypeptide (e.*.. replacement of an amino acid m a region 
which is not believed to form an antigenic epitope). For example, since 
antigenic epitopes have been identified which contain as few as six amino acids 
(see Harlow, et al.. Antibodies: A Laboratory Manual, 2nd Ed.; Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York (1988), page 76), in 
instances where a polypeptide has multiple antigenic epitopes the alteration 
several amino acid residues would often not be expected to eliminate all of the 
antigenic epitopes of that polypeptide. This is especially so when the alterations 
are in regions believed to not constitute antigenic epitopes. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated 
DNA molecules of the present invention, host cells which are genetically 
engineered with the recombinant vectors, and the production of S. pneumoniae 
polypeptides or fragments thereof by recombinant techniques. 

Recombinant constructs may be introduced into host cells using well 
known techniques such as infection, transduction, transfection, transvection, 
electroporation and transformation. The vector may be, for example, a phage, 
plasmid, viral or retroviral vector. Retroviral vectors may be replication 
competent or replication defective. In the latter case, viral propagation generally 

will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable 

marker for propagation in a host. Generally, a plasmid vector is introduced i 
precipitate, such as a calcium phosphate precipitate, or in a complex with a 
charged lipid. If the vector is a virus, it may be packaged in vitro using an 

appropriate packaging cell line and then transduced into host cells. 

Preferred are vectors comprising exacting control regions to e 
polynucleotide of interest. Appropriate , reacting factors may be supplied by 
the host, supplied by a complementing vector or supplied by the vector ttsel 

upon introduction into the host. 

In certain preferred embodiments in this regard, the vectors provide for 
specific expression, which may be inducible and/or cell type-specific 
Particularly preferred among such vectors are those inducible by environmental 
factors that are easy to manipulate, such as temperature and nutrient additives. 
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Expression vectors useful i„ lhe pr e sent h, venlion ; 
chromosomal-, episomal- and vtrus-dertved vectors, ,g„ vectom dertved from 
enal plasm, ds, bacteriophage, yeast episomes, yeas, chromosomal 
elements, v, ruses such as baculovinises, papova viruses, vaccinia viruses 
adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses and 
vectors den ved from combinations thereof, such as costmds and phagemids. 

such as the h * operative, 5 r li"ked >o an appropriate promoter, 

TsC o ear 8 d ^ P ~ £ « Promoters. 

few Oth h 316 Pr0m °‘ erS ^ Pf0m0KrS ° f retroVinl1 LTRs - <° * 

few. outer suttable promoter will be known to the skilled artisan The 

express, on const™, s will funher contain sites for nanscription initiation 

~ vr ■“ ^ «*-■« fo ; 

o m = portion of the mature transcripts expressed by the 
onstrucs w,„ preferably tnclude a Nation mitiatmg site a, the beginning and 

— T° T A ' UGA or UAG) appr ™ « * - 

of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase or neomyl 
resistance for euka^otic cell culture and tetracycline or ampictllin resist!" 
genes for culturing £ coll and other bacteria. Representative examples of 
appropriate hosts include, bu, are no, limited to, bacterial cells, such as £ coll 
Joyces and Salmonella , yphmuriao, cells: fungal cells, such as yeas,’ 

’ “ ,S “ :tceUs such “ Drosophila S2 and Spodopura Sf9 cells: animal cells 
such as CHO, COS and Bowes melanoma cells: and plan, cells. Appropriate 

culture mediums and conditions for the above-described host cells are known 

ine art. 

doe <j Am °";, V ? 0rS preferred f0r use bllc,eria include pQE70, pQE60 and 
PQE-9, available from Qiagen; pBS vectors; Phagescrip, vectors. Bluescript 

vectors. pNHSA, pNH16a, pNHISA, pNH46A available from Stratagene- pCT 
senes of vectors available from Novagen: and ptreSSa, pKK223-3 pK K 033 3 
pDR540, P RfP5 available fmm Pharmacta. Anjg 
vectors are pWLNEO, pSV2CAT, pOG44, pXTl and pSO avmlableTrom 
Stratagene; and pSVK3, pBPV, pMSO and pSVL available from Phtumacia. 
Other suitable vectots will be readily apparent to die skilled anisan. 

Among known bacterial promoters suitable for use in the present 
invention tnclude Ute £. coli lad and lacZ promorers, die T3 and T7 promoters 

stH e7 0,er ' PR “* PL - - «P P~ 

eukaryotic promoters include the CMV immediate early promoter, die 
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HSV thymidine kinase promoter, the early and late SV40 promoters the 
promoters of retroviral LTRs, such as those of the Rous sarcoma virus ( 
and metallothionein promoters, such as the mouse metallothionein-I promoter. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic 
lipid-mediated transfection, electroporation, transduction, infection or other 
methods. Such methods are described in many standard laboratory manuals 
(for example, Davis, et al., Basic Methods In Molecular Biology (1986)). 

Transcription of DNA encoding the polypeptides of the present 
invention by higher eukaryotes may be increased by inserting an enhancer 
sequence into the vector. Enhancers are m-acting elements of DNA, usually 
about from 10 to 300 bp that act to increase transcriptional activity of a promoter 
in a given host cell-type. Examples of enhancers include the SV40 enhancer, 
which is located on the late side of the replication origin at bp 100 to 270, the 
cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side 

of the replication origin, and adenovirus enhancers. 

For secretion of the translated polypeptide into the lumen of the 
endoplasmic reticulum, into the penplasmic space or into the extracellular 
environment, appropriate secretion signals may be incorporated into the 
expressed polypeptide. The signals may be endogenous to the polypeptide or 
they may be heterologous signals. 

The polypeptide may be expressed in a modified form, such as a usion 
protein, and may include not only secretion signals, but also additional 
heterologous functional regions. For instance, a region of additional amino 
acids, particularly charged amino acids, may be added to the N-.erminus of the 
polypeptide to improve stability and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide moieties 
may be added to the polypeptide to facilitate purification. Such regions may be 
removed prior to final preparation of die polypeptide. The addltton of peptide 
moiefies to polypeptides to engender secretion or excretion, to imptove stability 
and to facilitate purification, among others, are familiar and routine techniques 
in the art. A preferred fusion protein comprises a heterologous region from 
immunoglobulin that is useful to solubilize proteins. For example, EP-A-O 464 
533 (Canadian counteipart 2045869) discloses fusion proteins comprising 
various portions of constant region of immunoglobin molecules together with 
another human protein or part thereof. In many cases, the Fc part in a usion 
protein is thoroughly advantageous for use in therapy and diagnosis " 
results, for example, in improved pharmacokinetic properties (EP-A 0232 - .). 
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On .he otter hand, for some uses i. would be desirable .0 be able .0 delete the 
c par. after .he fus,on prolein has been expressed, deteeted and purified in tte 
advantageous manner described. This is tte case when Fc ponton proves .0 be 
an ranee to use in therapy and diagnosis, for example when .he fusion 
P tern ,s .0 be used as aniigen for immunizations. In drug discovery for 
example, human prole, ns, such as, hE.5-recep.or has been fused with Fc 
pomons for tte purpose of high-rhroughpu. screening assays .0 identify 
an agoms.s of h,L-5. gee Bennett, D. e,nf„ 7. Mole, R eco g , ,. 8:52- 5 8 
an o anson, K. er «/., 7. Biol. Chem. 270 (/r5J:9459-947l (1995) 

The 5. pneumoniae polypeptides can be recovered and purified from 
recombinant cell cultures by well-known methods including ammonium sulfate 
or ethanol precip, union, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography 
lecun chromatography and high performance liquid chromatography (”HPLC")' 

IS employed fpr purification. Polypeptides of the present invemron include 
naturally p „ nfi ed products, products of chemical synthetic procedures and 
pro nets produced by recombinant techniques from a prokruyotic or eukmyotic 
host, rncludrng, for example, bacterial, yeast, higher plan,, ,„ sec , ^ 
mammalian cells. 


Polypeptides and Fragments 

The invention further provides isolated polypeptides having the amino 

aetd sequences described in Table 1, and shown as SEQ ID NO:2, SEO E> 

N0:4, SEQ B N0:6, and so on through SEQ B NO:226, and peptides or 

poypepndes comprising portions of the above polypeptides. The tetms 

pcpt.de and "oligopeptide- are considered synonymous (as is commonly 

recognized) and each term can be used interchangeably as the context requires to 

indicate a charn of at least two amino acids coupled by peptidyl linkages. The 

wor polypepude" ts used herein for chains containing more than ten amino 

acid res, dues. All ohgopeptide and polypeptide formulas or sequences herein 

are written from left ,0 right and in the direction from amino terminus .0 ctuboxy 
terminus. y 

Some amino acid sequences of tte J. pneumoniae polypeptides 
esen e tn Table 1 can be varied without significantly effecting tte antigenicity 
of tte polypeptides. If such deferences in sequence are contemplated, i, should 
be remembered that there win be critical areas on tte polypeptide which 
detemune anttgenicuy. In general, i, is possible to replace residues which do 
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not form part of an antigenic epitope without significantly effecting the 
antigenicity of a polypeptide. Guidance for such alterations is given in Table 2 

wherein epitopes for each polypeptide is delineated. 

The polypeptides of the present invention are preferably provided in an 
isolated form. By “isolated polypeptide” is intended a polypeptide removed 
from its native environment. Thus, a polypeptide produced and/or contained 
within a recombinant host cell is considered isolated for purposes of the present 
invention. Also intended as an “isolated polypeptide” is a polypeptide that has 
been purified, partially or substantially, from a recombinant host cell. For 
example, recombinantly produced versions of the S. pneumoniae polypeptides 
described in Table 1 can be substantially purified by the one-step method 

described by Smith and Johnson ( Gene 67:3 1-40 ( 1988)). 

The polypeptides of the present invention include: (a) an amino acid 

sequence of any of the polypeptides described in Table 1; and (b) an amino acid 

sequence of an epitope-bearing portion of any one of the polypeptides of (a), as 
well as polypeptides with at least 70% similarity, and more preferably at least 
75% 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% similarity to those 
described in (a) or (b) above, as well as polypeptides having an ammo acid 
sequence at least 70% identical, more preferably at least 75% identical, and still 
more preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to 

those above. 

By "% similarity" for two polypeptides is intended a similarity score 
produced by comparing the amino acid sequences of the two polypeptides using 
the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for 
Unix, Genetics Computer Group, University Research Park, 575 Science 
Drive, Madison, WI 53711) and the default settings for determining similarity. 
Bestfit uses the local homology algorithm of Smith and Waterman {Advances in 
Applied Mathematics 2:482-489 (1981)) to find the best segment of similarity 

between two sequences. 

By a polypeptide having an amino acid sequence at least, for example, 
95% "identical" to a reference amino acid sequence of a S. pneumoniae 
polypeptide is intended that the amino acid sequence of the polypeptide is 
identical to the reference sequence except that the polypeptide sequence m y 
include up to five amino acid alterations per each 100 amino acids of the 
reference amino acid sequence. In other words, to obtain a polypeptide having 
an amino acid sequence at least 95% identical to a reference amino acid 
sequence, up to 5% of the amino acid residues in the reference sequence may be 
deleted or substituted with another amino acid, or a number of amino acids up to 
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5% of the total amino acid residues in the reference sequence may be inserted 
into the reference sequence. These alterations of the reference sequence may 
occur at the amino or carboxy terminal positions of the reference amino acid 
sequence or anywhere between those terminal positions, interspersed either 
md,v, dually among residues in the reference sequence or in one or more 
contiguous groups within the reference sequence. 

The amino acid sequences shown in Table 1 may have on or more “X” 
residues. “X” represents unknown. Thus, for purposes of defining identity, if 
any amino acid is present at the same position in a reference amino acid 

sequence (shown in Table 1) where an X is shown, the two sequences are 
identical at that position. 

As a practical matter, whether any particular polypeptide is at least 70%, 
75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to, for 
instance, an amino acid sequence shown in Table 1, can be determined 
conventionally using known computer programs such the Bestfit program 
(Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics 
Computer Group, University Research Park, 575 Science Drive, Madison, WI 
53711). When using Bestfit or any other sequence alignment program to 
determine whether a particular sequence is, for instance, 95% identical to a 
reference sequence according to the present invention, the parameters are set, of 
course, such that the percentage of identity is calculated over the full length of 
the reference amino acid sequence and that gaps in homology of up to 5% of the 
total number of amino acid residues in the reference sequence are allowed. 

As described below, the polypeptides of the present invention can also 
be used to raise polyclonal and monoclonal antibodies, which are useful in 
assays for detecting Streptococcal protein expression. 

In another aspect, the invention provides peptides and polypeptides 
comprising epitope-bearing portions of the 5. pneumoniae polypeptides of the 
invention. These epitopes are immunogenic or antigenic epitopes of the 
polypeptides of the invention. An "immunogenic epitope" is defined as a part of 
a protein that elicits an antibody response when the whole protein or polypeptide 
is the immunogen. These immunogenic epitopes are believed to be confined to 
a few loci on the molecule. On the other hand, a region of a protein molecule to 
which an antibody can bind is defined as an "antigenic determinant" or 
antigenic epitope." The number of immunogenic epitopes of a protein 
generally is less than the number of antigenic epitopes (Geysen, et al, Proc 

Natl. Acad. Sci. USA 81:3998- 4002 (1983)). Predicted antigenic epitopes are 
shown in Table 2, below. 
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As to the selection of peptides or polypeptides bearing an antigenic 
epitope (£.«?., that contain a region of a protein molecule to which an antibody 
can bind), it is well known in that art that relatively short synthetic peptides that 
mimic part of a protein sequence are routinely capable of eliciting an antiserum 
that reacts with the partially mimicked protein (for instance, Sutcliffe, J., et al. 
Science 219:660-666 (1983)). Peptides capable of eliciting protein-reactive 
sera are frequently represented in the primary sequence of a protein, can be 
characterized by a set of simple chemical rules, and are confined neither to 
immunodominant regions of intact proteins (i.e., immunogenic epitopes) nor to 
the amino or carboxyl terminals. Peptides that are extremely hydrophobic and 
those of six or fewer residues generally are ineffective at inducing antibodies 
that bind to the mimicked protein; longer, peptides, especially those containing 
proline residues, usually are effective (Sutcliffe, et al., supra, p. 661). For 
instance, 1 8 of 20 peptides designed according to these guidelines, containing 
8-39 residues covering 75% of the sequence of the influenza virus 
hemagglutinin HA 1 polypeptide chain, induced antibodies that reacted with the 
HA1 protein or intact vims; and 12/12 peptides from the MuLV polymerase and 
18/18 from the rabies glycoprotein induced antibodies that precipitated the 
respective proteins. 

Antiaenic epitope-bearing peptides and polypeptides of the invention are 
therefore useful to raise antibodies, including monoclonal antibodies, that bind 
specifically to a polypeptide of the invention. Thus, a high proportion of 
hybridomas obtained by fusion of spleen cells from donors immunized with an 
antigen epitope-bearing peptide generally secrete antibody reactive with the 
native protein (Sutcliffe, et al., supra, p. 663). The antibodies raised by 
antigenic epitope-bearing peptides or polypeptides are useful to detect the 
mimicked protein, and antibodies to different peptides may be used for tracking 
the fate of various regions of a protein precursor which undergoes 
post-translational processing. The peptides and anti-peptide antibodies may be 
used in a variety of qualitative or quantitative assays for the mimicked protein, 
for instance in competition assays since it has been shown that even short 
peptides (e.g., about 9 amino acids) can bind and displace the larger peptides in 
immunoprecipitation assays (for instance, Wilson, et al, Cell 37:767-778 
(1984) p. 777). The anti-peptide antibodies of the invention also are useful for 
purification of the mimicked protein, for instance, by adsorption 
chromatography using methods well known in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention 
designed according to the above guidelines preferably contain a sequence of at 
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least seven, more preferably at least nine and most preferably between about 15 
to about 30 amino acids contained within the amino acid sequence of a 
polypeptide of the invention. However, peptides or polypeptides comprising a 
larger portion of an amino acid sequence of a polypeptide of the invention, 
containing about 30 to about 50 amino acids, or any length up to and including 
the entire amino acid sequence of a polypeptide of the invention, also are 
considered epitope-bearing peptides or polypeptides of the invention and also 
are useful for inducing antibodies that react with the mimicked protein. 
Preferably, the amino acid sequence of the epitope-bearing peptide is selected to 
provide substantial solubility in aqueous solvents ( i.e ., the sequence includes 
relatively hydrophilic residues and highly hydrophobic sequences are preferably 
avoided); and sequences containing proline residues are particularly preferred. 

Non-limiting examples of antigenic polypeptides or peptides that can be 
used to generate Streptococcal - specific antibodies include portions of the amino 
acid sequences identified in Table 1. More specifically. Table 2 discloses 
antigenic fragments of polypeptides of the present invention, which antigenic 
fragments comprise amino acid sequences from about the first amino acid 
residues indicated to about the last amino acid residue indicated for each 
fragment. The polypeptide fragments disclosed in Table 2 are believed to be 
antigenic regions of the S. pneumoniae polypeptides described in Table 1. Thus 
the invention further includes isolated peptides and polypeptides comprising an 
amino acid sequence of an epitope shown in Table 2 and polynucleotides 
encoding said polypeptides. 

The epitope-bearing peptides and polypeptides of the invention may be 
produced by any conventional means for making peptides or polypeptides 
including recombinant means using nucleic acid molecules of the invention. For 
instance, an epitope-bearing amino acid sequence of the present invention may 
be fused to a larger polypeptide which acts as a carrier during recombinant 
production and purification, as well as during immunization to produce 
anti-peptide antibodies. Epitope-bearing peptides also may be synthesized using 
known methods of chemical synthesis. For instance, Houghten has described a 
simple method for synthesis of large numbers of peptides, such as 10-20 mg of 
248 different 13 residue peptides representing single amino acid variants of a 
segment of the HA1 polypeptide which were prepared and characterized (by 
ELISA-type binding studies) in less than four weeks (Houghten, R. A. Proc. 
Natl. Acad. Sci. USA 82:5131-5135 (1985)). This "Simultaneous Multiple 
Peptide Synthesis (SMPS)" process is further described in U.S. Patent No. 
4,631,21 1 to Houghten and coworkers (1986). In this procedure the individual 
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resins for the solid-phase synthesis of various peptides are contained in separate 
solvent-permeable packets, enabling the optimal use of the many identical 
repetitive steps involved in solid-phase methods. A completely manual 
procedure allows 500-1000 or more syntheses to be conducted simultaneously 
(Houghten, et al., supra, p. 5134). 

Epitope-bearing peptides and polypeptides of the invention are used to 
induce antibodies according to methods well known in the art (for instance, 
Sutcliffe, et al., supra', Wilson, et al., supra', Chow, M., et al., Proc. Natl. 
Acad. Sci. USA 82:910-914; and Bittle, F. J., et al., J. Gen. Virol. 
66:2347-2354 (1985)). Generally, animals may be immunized with free 
peptide; however, anti-peptide antibody titer may be boosted by coupling of the 
peptide to a macromolecular carrier, such as keyhole limpet hemacyanin (KLH) 
or tetanus toxoid. For instance, peptides containing cysteine may be coupled to 
carrier using a linker such as m-maleimidobenzoyl-N-hydroxysuccinimide ester 
(MBS), while other peptides may be coupled to carrier using a more general 
linking agent such as glutaraldehyde. Animals such as rabbits, rats and mice are 
immunized with either free or carrier-coupled peptides, for instance, by 
intraperitoneal and/or intradermal injection of emulsions containing about 100 
pg peptide or carrier protein and Freund's adjuvant. Several booster injections 
may be needed, for instance, at intervals of about two weeks, to provide a 
useful titer of anti-peptide antibody which can be detected, for example, by 
ELISA assay using free peptide adsorbed to a solid surface. The titer of 
anti-peptide antibodies in serum from an immunized animal may be increased by 
selection of anti-peptide antibodies, for instance, by adsorption to the peptide on 
a solid support and elution of the selected antibodies according to methods well 
known in the art. 

Immunogenic epitope-bearing peptides of the invention, i.e., those parts 
of a protein that elicit an antibody response when the whole protein is the 
immunogen, are identified according to methods known in the art. For 
instance, Geysen, et al., supra, discloses a procedure for rapid concurrent 
synthesis on solid supports of hundreds of peptides of sufficient purity to react 
in an enzyme-linked immunosorbent assay. Interaction of synthesized peptides 
with antibodies is then easily detected without removing them from the support. 
In this manner a peptide bearing an immunogenic epitope of a desired protein 
may be identified routinely by one of ordinary skill in the art. For instance, the 
immunologically important epitope in the coat protein of foot-and-mouth disease 
virus was located by Geysen et al. supra with a resolution of seven amino acids 
by synthesis of an overlapping set of all 208 possible hexapeptides covering the 
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entire 213 amino acid sequence of the protein. Then, a complete replacement set 
of peptides in which all 20 amino acids were substituted in turn at every position 
within the epitope were synthesized, and the particular amino acids conferring 
specificity for the reaction with antibody were determined. Thus, peptide 
5 analogs of the epitope-bearing peptides of the invention can be made routinely 

by this method. U.S. Patent No. 4,708,781 to Geysen (1987) further describes 
this method of identifying a peptide bearing an immunogenic epitope of a 
desired protein. 

Further still, U.S. Patent No. 5,194,392, to Geysen (1990), describes a 
10 general method of detecting Or determining the sequence of monomers (amino 

acids or other compounds) which is a topological equivalent of the epitope ( i.e ., 
a "mimotope") which is complementary to a particular paratope (antigen binding 
site) of an antibody of interest. More generally, U.S. Patent No. 4,433,092, 
also to Geysen (1989), describes a method of detecting or determining a 
15 sequence of monomers which is a topographical equivalent of a ligand which is 

complementary to the ligand binding site of a particular receptor of interest. 
Similarly, U.S. Patent No. 5,480,971 to Houghten, R. A. et al. (1996) 
discloses linear CNC^-alkyl peralkylated oligopeptides and sets and libraries of 

such peptides, as well as methods for using such oligopeptide sets and libraries 
20 for determining the sequence of a peralkylated oligopeptide that preferentially 

binds to an acceptor molecule of interest. Thus, non-peptide analogs of the 
epitope-bearing peptides of the invention also can be made routinely by these 
methods. 

The entire disclosure of each document cited in this section on 
25 "Polypeptides and Fragments" is hereby incorporated herein by reference. 

As one of skill in the art will appreciate, the polypeptides of the present 
invention and the epitope-bearing fragments thereof described above can be 
combined with parts of the constant domain of immunoglobulins (IgG), 
resulting in chimeric polypeptides. These fusion proteins facilitate purification 
30 and show an increased half-life in vivo. This has been shown, e.g., for 

chimeric proteins consisting of the . first two domains of the human 
CD4-polypeptide and various domains of the constant regions of the heavy or 
light chains of mammalian immunoglobulins (EPA 0,394,827; Traunecker et 
al.. Nature 331: 84-86 (1988)). Fusion proteins that have a disulfide-linked 
35 dimeric structure due to the IgG part can also be more efficient in binding and 

neutralizing other molecules than a monomeric S. pneumoniae polypeptide or 
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fragment thereof alone (Fountoulakis et al., J. Biochem. 270: 3958-3964 
(1995)). 

Diagnostic Assays 

5 The present invention further relates to a method for assaying for 

Streptococcal infection in an animal via detecting the expression of genes 
encoding Streptococcal polypeptides (e.g., the polypeptides described Table 1). 
This method comprises analyzing tissue or body fluid from the animal for 
Streptococcus-specific antibodies or Streptococcal nucleic acids or proteins. 
10 Analysis of nucleic acid specific to Streptococcus can be done by PCR or 

hybridization techniques using nucleic acid sequences of the present invention 
as either hybridization probes or primers (cf Molecular Cloning: A Laboratory 
Manual, second edition, edited by Sambrook, Fritsch, & Maniatis, Cold Spring 
Harbor Laboratory, 1989; Eremeeva et al, J. Clin. Microbiol. J2; 803-8 10 
15 (1994) which describes differentiation among spotted fever group Rickettsiae 

species by analysis of restriction fragment length polymorphism of 
PCR-amplified DNA). Methods for detecting B. burgdorferi nucleic acids via 
PCR are described, for example, in Chen et al, J. Clin. Microbiol. 32: 589-595 
(1994). 

20 Where diagnosis of a disease state related to infection with 

Streptococcus has already been made, the present invention is useful for 
monitoring progression or regression of the disease state whereby patients 
exhibiting enhanced Streptococcus gene expression will experience a worse 
clinical outcome relative to patients expressing these gene(s) at a lower level. 

25 By "assaying for Streptococcal infection in an animal via detection of 

genes encoding Streptococcal polypeptides" is intended qualitatively or 
quantitatively measuring or estimating the level of one or more Streptococcus 
polypeptides or the level of nucleic acid encoding Streptococcus polypeptides in 
a first biological sample either direcdy (e.g. , by determining or estimating 
30 absolute protein level or nucleic level) or relatively (e.g., by comparing to the 

Streptococcus polypeptide level ormRNA level in a second biological sample). 
The Streptococcus polypeptide level or nucleic acid level in the second sample 
used for a relative comparison may be undetectable if obtained from an animal 
which is not infected with Streptococcus. When monitoring the progression or 
35 regression of a disease state, the Streptococcus polypeptide level or nucleic acid 

level may be compared to a second sample obtained from either an animal 
infected with Streptococcus or the same animal from which the first sample was 
obtained but taken from that animal at a different time than the first. As will be 
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appreciated in the art, once a standard Streptococcus polypeptide level or nucleic 
acid level which corresponds to a particular stage of a Streptococcus infection is 
known, it can be used repeatedly as a standard for comparison. 

By "biological sample" is intended any biological sample obtained from 
an animal, cell line, tissue culture, or other source which contains Streptococcus 
polypeptide, mRNA, or DNA. Biological samples include body fluids (such as 
plasma and synovial fluid) which contain Streptococcus polypeptides, and 
muscle, skin, and cartilage tissues. Methods for obtaining tissue biopsies and 
body fluids are well known in the art. 

The present invention is useful for detecting diseases related to 
Streptococcus infections in animals. Preferred animals include monkeys, apes, 
cats, dogs, cows, pigs, mice, horses, rabbits and humans. Particularly 
preferred are humans. 

Total RNA can be isolated from a biological sample using any suitable 
technique such as the single-step guanidinium-thiocyanate-phenol-chloroform 
method described in Chomczynski and Sacchi, Anal. Biochem. 162: 156-159 
(1987). mRNA encoding Streptococcus polypeptides having sufficient 
homology to the nucleic acid sequences identified in Table 1 to allow for 
hybridization between complementary sequences are then assayed using any 
appropriate method. These include Northern blot analysis, SI nuclease 
mapping, the polymerase chain reaction (PCR), reverse transcription in 
combination with the polymerase chain reaction (RT-PCR), and reverse 
transcription in combination with the ligase chain reaction (RT-LCR). 

Northern blot analysis can be performed as described in Harada et al„ 
Cell 63: 303-312 (1990). Briefly, total RNA is prepared from a biological 
sample as described above. For the Northern blot, the RNA is denatured in an 
appropriate buffer (such as glyoxal/dimethyl sulfoxide/sodium phosphate 
buffer), subjected to agarose gel electrophoresis, and transferred onto a 
nitrocellulose filter. After the RNAs have been linked to the filter by a UV 
linker, the filter is prehybridized in a solution containing formamide, SSC, 
Denhardt's solution, denatured salmon sperm, SDS, and sodium phosphate 
buffer. A S. pnuemoniae polypeptide DNA sequence shown in Table 1 labeled 
according to any appropriate method (such as the 32 P-multiprimed DNA labeling 
system (Amersham)) is used as probe. After hybridization overnight, the filter 
is washed and exposed to x-ray film. DNA for use as probe according to the 
present invention is described in the sections above and will preferably at least 
15 bp in length. 
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SI mapping can be performed as described in Fujita et aL , Cell 
49: 357-367 (1987). To prepare probe DNA for use in SI mapping, the sense 
strand of an above-described S. pnuemoniae DNA sequence of the present 
invention is used as a template to synthesize labeled antisense DNA. The 
antisense DNA can then be digested using an appropriate restriction 
endonuclease to generate further DNA probes of a desired length. Such 
antisense probes are useful for visualizing protected bands corresponding to the 
target mRNA (/.e., mRNA encoding Streptococcus polypeptides). 

Preferably, levels of mRNA encoding Streptococcus polypeptides are 
assayed using the RT-PCR method described in Makmo et aL, 
Technique 2:295-301 (1990). By this method, the radioactivities of the 
"amplicons” in the polyacrylamide gel bands are linearly related to the initial 
concentration of the target mRNA. Briefly, this method involves adding total 
RNA isolated from a biological sample in a reaction mixture containing a RT 
primer and appropriate buffer. After incubating for primer annealing, the 
mixture can be supplemented with a RT buffer, dNTPs, DTT, RNase inhibitor 
and reverse transcriptase. After incubation to achieve reverse transcription of 
the RNA, the RT products are then subject to PCR using labeled primers. 
Alternatively, rather than labeling the primers, a labeled dNTP can be included 
in the PCR reaction mixture. PCR amplification can be performed in a DNA 
thermal cycler according to conventional techniques. After a suitable number of 
rounds to achieve amplification, the PCR reaction mixture is electrophoresed on 
a polyacrylamide gel. After drying the gel, the radioactivity of the appropriate 
bands (corresponding to the mRNA encoding the Streptococcus polypeptides)) 
is quantified using an imaging analyzer. RT and PCR reaction ingredients and 
conditions, reagent and gel concentrations, and labeling methods are well 
known in the art. Variations on the RT-PCR method will be apparent to the 
skilled artisan. 

Assaying Streptococcus polypeptide levels in a biological sample can 
occur using any art-known method. Preferred for assaying Streptococcus 
polypeptide levels in a biological sample are antibody-based techniques. For 
example. Streptococcus polypeptide expression in tissues can be studied with 
classical immunohistological methods. In these, the specific recognition is 
provided by the primary antibody (polyclonal or monoclonal) but the secondary 
detection system can utilize fluorescent, enzyme, or other conjugated secondary 
antibodies. As a result, an immunohistological staining of tissue section for 
pathological examination is obtained. Tissues can also be extracted, e.g with 
urea and neutral detergent, for the liberation of Streptococcus polypeptides for 
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Western-blot or dot/slot assay (Jalkanen, M., et at., J. Cell. Biol. 707:976-985 
(1985); Jalkanen, M„ et al., J. Cell . Biol. 705:3087-3096 (1987)). In this 
technique, which is based on the use of cationic solid phases, quantitation of a 
Streptococcus polypeptide can be accomplished using an isolated Streptococcus 
polypeptide as a standard. This technique can also be applied to body fluids. 

Other antibody-based methods useful for detecting Streptococcus 
polypeptide gene expression include immunoassays, such as the enzyme linked 
immunosorbent assay (ELISA) and the radioimmunoassay (RIA). For 
example, a Streptococcus polypeptide-specific monoclonal antibodies can be 
used both as an immunoabsorbent and as an enzyme-labeled probe to detect and 
quantify a Streptococcus polypeptide. The amount of a Streptococcus 
polypeptide present in the sample can be calculated by reference to the amount 
present in a standard preparation using a linear regression computer algorithm. 
Such an ELISA for detecting a tumor antigen is described in Iacobelli et al.. 
Breast Cancer Research and Treatment 77:19-30 (1988). In another ELISA 
assay, two distinct specific monoclonal antibodies can be used to detect 
Streptococcus polypeptides in a body fluid. In this assay, one of the antibodies 
is used as the immunoabsorbent and the other as the enzyme-labeled probe. 

The above techniques may be conducted essentially as a "one-step" or 
"two-step" assay. The "one-step" assay involves contacting the Streptococcus 
polypeptide with immobilized antibody and, without washing, contacting the 
mixture with the labeled antibody. The "two-step" assay involves washing 
before contacting the mixture with the labeled antibody. Other conventional 
methods may also be employed as suitable. It is usually desirable to immobilize 
one component of the assay system on a support, thereby allowing other 
components of the system to be brought into contact with the component and 
readily removed from the sample. 

Streptococcus polypeptide-specific antibodies for use in the present 
invention can be raised against an intact S. pneumoize polypeptide of the present 
invention or fragment thereof. These polypeptides and fragments may be 
administered to an animal (e.g., rabbit or mouse) either with a carrier protein 
(e.g., albumin) or, if long enough (e.g., at least about 25 amino acids), without 
a carrier. 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" 
(Mab) is meant to include intact molecules as well as antibody fragments (such 
as, for example. Fab and Ffab'^ fragments) which are capable of specifically 
binding to a Streptococcus polypeptide. Fab and F(ab')^ fragments lack the Fc 
fragment of intact antibody, clear more rapidly from the circulation, and may 
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have less non-specific tissue binding of an intact antibody (Wahl et al, J. Nucl. 
Med. 24: 316-325 (1983)). Thus, these fragments are preferred. 

The antibodies of the present invention may be prepared by any of a 
variety of methods. For example, the S. pneumoniae polypeptides identified in 
Table 1, or fragments thereof, can be administered to an animal in order to 
induce the production of sera containing polyclonal antibodies. In a preferred 
method, a preparation of a S. pneumoniae polypeptide of the present invention 
is prepared and purified to render it substantially free of natural contaminants. 
Such a preparation is then introduced into an animal in order to produce 
polyclonal antisera of high specific activity. 

In the most preferred method, the antibodies of the present invention are 
monoclonal antibodies. Such monoclonal antibodies can be prepared using 
hybridoma technology (Kohler et al., Nature 256: 495 (1975); Kohler et al., 
Eur. J. Immunol. 6:511 (1976); Kohler et al, Eur. J. Immunol. 6: 292 (1976); 
Hammerling et al. In: Monoclonal Antibodies and T-Cell Hybridomas, 
Elsevier, N.Y., (1981) pp. 563-681 ). In general, such procedures involve 
immunizing an animal (preferably a mouse) with a S. pneumoniae polypeptide 
antigen of the present invention. Suitable cells can be recognized by their 
capacity to bind anti -Streptococcus polypeptide antibody. Such cells may be 
cultured in any suitable tissue culture medium; however, it is preferable to 
culture cells in Earle's modified Eagle’s medium supplemented with 10% fetal 
bovine serum (inactivated at about 56°C), and supplemented with about 10 g/1 
of nonessential amino acids, about 1,000 U/ml of penicillin, and about 100 
pg/ml of streptomycin. The splenocytes of such mice are extracted and fused 
with a suitable myeloma cell line. Any suitable myeloma cell line may be 
employed in accordance with the present invention; however, it is preferable to 
employ the parent myeloma cell line (SP^O), available from the American Type 
Culture Collection, Rockville, Maryland. After fusion, the resulting hybridoma 
cells are selectively maintained in HAT medium, and then cloned by limiting 
dilution as described by Wands et al. ( Gastroenterology 80: 225-232 (1981)). 
The hybridoma cells obtained through such a selection are then assayed to 
identify clones which secrete antibodies capable of binding the Streptococcus 
polypeptide antigen administered to immunized animal. 

Alternatively, additional antibodies capable of binding to Streptococcus 
polypeptide antigens may be produced in a two-step procedure through the use 
of anti-idiotypic antibodies. Such a method makes use of the fact that antibodies 
are themselves antigens, and that, therefore, it is possible to obtain an antibody 
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which binds to a second antibody. In accordance with this method. 

Streptococcus polypeptide-specific antibodies are used to immunize an animal, 

preferably a mouse. The splenocytes of such an animal are then used to 

produce hybridoma cells, and the hybridoma cells are screened to identify 

5 clones which produce an antibody whose ability to bind to the Streptococcus 

polypeptide-specific antibody can be blocked by a Streptococcus polypeptide 

antigen. Such antibodies comprise anti-idiotypic antibodies to the Streptococcus 

polypeptide-specific antibody and can be used to immunize an animal to induce 

formation of further Streptococcus polypeptide-specific antibodies. 

10 It will be appreciated that Fab and F(ab') 2 and other fragments of the 

antibodies of the present invention may be used according to the methods 
disclosed herein. Such fragments are typically produced by proteolytic 
cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin 
(to produce F(ab'), fragments). Alternatively, Streptococcus 

15 polypeptide-binding fragments can be produced through the application of 

recombinant DNA technology or through synthetic chemistry. 

Of special interest to the present invention are antibodies to 
Streptococcus polypeptide antigens which are produced in humans, or are 
"humanized” (i.e., non-immunogenic in a human) by recombinant or other 
20 technology. Humanized antibodies may be produced, for example by replacing 

an immunogenic portion of an antibody with a corresponding, but non- 
immunogenic portion (i.e., chimeric antibodies) (Robinson, R.R. et al.. 
International Patent Publication PCT/US86/02269; Akira, K. et al., European 
Patent Application 184,187; Taniguchi, M., European Patent Application 
25 171,496; Morrison, S.L. et al., European Patent Application 173,494; 

Neuberger, M.S. et al., PCT Application WO 86/01533; Cabilly, S. et al., 
European Patent Application 125,023; Better, M. et al.. Science 
240: 1041-1043 (1988); Liu, A.Y. et al., Proc. Natl. Acad. Sci. USA 
54:3439-3443 (1987); Liu, A.Y. et al., J. Immunol. 139: 3521-3526 (1987); 
30 Sun, L.K. etal., Proc. Natl. Acad. Sci. USA 84: 214-218 (1987); Nishimura, 

Y. etal.. Cane. Res. 47: 999-1005 (1987); Wood, C.R. etal.. Nature 
3/4:446-449 (1985)); Shaw et al., J. Natl. Cancer Inst. 50:1553-1559 (1988). 
General reviews of "humanized" chimeric antibodies are provided by Morrison, 
S.L. ( Science , 229:1202-1207 (1985)) and by Oi, V.T. etal., BioTechniques 
35 4:214 (1986)). Suitable "humanized" antibodies can be alternatively produced 

by CDR or CEA substitution (Jones, P.T. et al., Nature 321: 552-525 (1986); 
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Verhoeyan et al, Science 239: 1534 (1988); Beidler, C.B. et al, J. Immunol. 
747:4053-4060 (1988)). 

Suitable enzyme labels include, for example, those from the oxidase 
group, which catalyze the production of hydrogen peroxide by reacting with 
substrate. Glucose oxidase is particularly preferred as it has good stability and 
its substrate (glucose) is readily available. Activity of an oxidase label may be 
assayed by measuring the concentration of hydrogen peroxide formed by the 
enzyme-labeled antibody/substrate reaction. Besides enzymes, other suitable 
labels include radioisotopes, such as iodine (' 25 I, 12 'l), carbon (“c), sulphur 
( S), tritium ( H), indium (“in), and technetium (" m Tc), and fluorescent 
labels, such as fluorescein and rhodamine, and biotin. 


Further suitable labels for the Streptococcus polypeptide-specific 
antibodies of the present invention are provided below. Examples of suitable 
enzyme labels include malate dehydrogenase, staphylococcal nuclease, 
delta-5-steroid isomerase, yeast-alcohol dehydrogenase, alpha-glycerol 
phosphate dehydrogenase, triose phosphate isomerase, peroxidase, alkaline 
phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, 


urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and 
acetylcholine esterase. 

Examples of suitable radioisotopic labels include 3 H, "'in, ' 25 I, l3 'l, 

32 P 'V 51 rv 57 X~ 5 Vv, 5 V 75 o 15 V 90 67^ 217 . 211 212 

B, b, C, Cr, To, Co, Fe, Se, Eu, Y, Cu, Ct, At, Pb, 

47 1 <» 111 

be, Pd, etc. In is a preferred isotope where in vivo imaging is used since 
its avoids the problem of dehalogenation of the I or I-labeled monoclonal 
antibody by the liver. In addition, this radionucleotide has a more favorable 
gamma emission energy for imaging (Perkins et al, Eur. J. Nucl. Med. 
70:296-301 (1985); Carasquillo et al., J. Nucl. Med. 28: 281-287 (1987)). For 
example. In coupled to monoclonal antibodies with 

1 -(P-isothiocyanatobenzyl)-DPTA has shown little uptake in non-tumorous 
tissues, particularly the liver, and therefore enhances specificity of tumor 
localization (Esteban et al, J. Nucl. Med. 25:861-870 (1987)). 

Examples of suitable non-radioactive isotopic labels include ' 5? Gd 

55 » „ l62 ^ 56 ' ’ 

Mn, Dy, Tr, and Fe. 

Examples of suitable fluorescent labels include an IS *Eu label, a 
fluorescein label, an isothiocyanate label, a rhodamine label, a phycoerythrin 
label, a phycocyanin label, an allophycocyanin label, an o-phthaldehyde label, 
and a fluorescamine label. 


Examples of suitable toxin labels include diphtheria toxin, ricin, and 
cholera toxin. 
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Examples of chemiluminescent labels include a luminal label, an 
isoluminal label, an aromatic acridinium ester label, an imidazole label, an 
acridinium salt label, an oxalate ester label, a luciferin label, a luciferase label, 
and an aequorin label. 

Examples of nuclear magnetic resonance contrasting agents include 
heavy metal nuclei such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to antibodies 
are provided by Kennedy et al, Clin. Chim. Acta 70: 1-31 (1976), and Schurs 
et al., Clin. Chim. Acta 57:1-40 (1977). Coupling techniques mentioned in the 
latter are the glutaraldehyde method, the periodate method, the dimaleimide 
method, the m-maleimidobenzyl-N-hydroxy-succinimide ester method, all of 
which methods are incorporated by reference herein. 

In a related aspect, the invention includes a diagnostic kit for use in 
screening serum containing antibodies specific against 5. pneumoniae 
infection. Such a kit may include an isolated 5. pneumoniae antigen comprising 
an epitope which is specifically immunoreactive with at least one anti-5. 
pneumoniae antibody. Such a kit also includes means for detecting the binding 
of said antibody to the antigen. In specific embodiments, the kit may include a 
recombinantly produced or chemically synthesized peptide or polypeptide 
antigen. The peptide or polypeptide antigen may be attached to a solid support. 

In a more specific embodiment, the detecting means of the above- 
described kit includes a solid support to which said peptide or polypeptide 
antigen is attached. Such a kit may also include a non-attached reporter-labelled 
anti-human antibody. In this embodiment, binding of the antibody to the 5. 
pneumoniae antigen can be detected by binding of the reporter labelled antibody 
to the anti-5, pneumoniae antibody. 

In a related aspect, the invention includes a method of detecting 5. 
pneumoniae infection in a ‘subject. This detection method includes reacting a 
body fluid, preferrably serum, from the subject with an isolated 5. pneumoniae 
antigen, and examining the antigen for the presence of bound antibody. In a 
specific embodiment, the method includes a polypeptide antigen attached to a 
solid support, and serum is reacted with the support. Subsequently, the support 
is reacted with a reporter-labelled anti-human antibody. The support is then 
examined for the presence of reporter-labelled antibody. 

The solid surface reagent employed in the above assays and kits is 
prepared by known techniques for attaching protein material to solid support 
material, such as polymeric beads, dip sticks, 96-well plates or filter material. 
These attachment methods generally include non-specific adsorption of the 
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protein to the support or covalent attachment of the protein , typically through a 
free amine group, to a chemically reactive group on the solid support, such as 
an activated carboxyl, hydroxyl, or aldehyde group. Alternatively, streptavidin 
coated plates can be used in conjunction with biotinylated antigen(s). 

Therapeutics and Modes of Administration 

The present invention also provides vaccines comprising one or more 
polypeptides of the present invention. Heterogeneity in the composition of a 
vaccine may be provided by combining S. pneumoniae polypeptides of the 
present invention. Multi-component vaccines of this type are desirable because 
they are likely to be more effective in eliciting protective immune responses 
against multiple species and strains of the Streptococcus genus than single 
polypeptide vaccines. Thus, as discussed in detail below, a multi-component 
vaccine of the present invention may contain one or more, preferably 2 to about 
20, more preferably 2 to about 15, and most preferably 3 to about 8, of the 5. 
pneumoniae polypeptides identified in Table 1, or fragments thereof. 

Multi-component vaccines are known in the art to elicit antibody 
production to numerous immunogenic components. Decker, M. and Edwards, 
K., J. Infect. Dis. 774:S270-275 (1996). In addition, a hepatitis B, diphtheria, 
tetanus, pertussis tetravalent vaccine has recently been demonstrated to elicit 
protective levels of antibodies in human infants against all four pathogenic 
agents. Aristegui, J. et al.. Vaccine 75:7-9 (1997). 

The present invention thus also includes multi-component vaccines. 
These vaccines comprise more than one polypeptide, immunogen or antigen. 
An example of such a multi-component vaccine would be a vaccine comprising 
more than one of the S. pneumoniae polypeptides described in Table 1. A 
second example is a vaccine comprising one or more, for example 2 to 10, of 
the S. pneumoniae polypeptides identified in Table 1 and one or more, for 
example 2 to 10, additional polypeptides of either streptococcal or 
non-streptococcal origin. Thus, a multi-component vaccine which confers 
protective immunity to both a Streptococcal infection and infection by another 
pathogenic agent is also within the scope of the invention. 

As indicated above, the vaccines of the present invention are expected to 
elicit a protective immune response against infections caused by species and 
strains of Streptococcus other than strain ofS. pneumoniae deposited with that 
ATCC. 

Further within the scope of the invention are whole cell and whole viral 
vaccines. Such vaccines may be produced recombinantly and involve the 
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expression of one or more of the S. pneumoniae polypeptides described in 
Table 1. For example, the S’, pneumoniae polypeptides of the present invention 
may be either secreted or localized intracellular, on the cell surface, or in the 
penplasmic space. Further, when a recombinant virus is used, the S 
pneumoniae polypeptides of the present invention may, for example, be 
localized in the viral envelope, on the surface of the capsid, or internally within 
the capsid. Whole cells vaccines which employ cells expressing heterologous 
proteins are known in the art. See, e.g., Robinson, K. et al. Nature Biotech. 
15. 653-657 (1997); Sirard, J. et al.. Infect. Immun. <55:2029-2033 (1997); 
Chabalgoity, J. et al.. Infect. Immun. 65:2402-2412 (1997). These cells may 
be administered live or may be killed prior to administration. Chabalgoity, J. et 
al., supra, for example, report the successful use in mice of a live attenuated 
Salmonella vaccine strain which expresses a portion of a platyhelminth fatty 
acid-binding protein as a fusion protein on its cells surface. 

A multi-component vaccine can also be prepared using techniques 
known in the art by combining one or more S. pneumoniae polypeptides of the 
present invention, or fragments thereof, with additional non-streptococcal 
components (e.g., diphtheria toxin or tetanus toxin, and/or other compounds 
known to elicit an immune response). Such vaccines are useful for eliciting 
protective immune responses to both members of the Streptococcus genus and 
non-streptococcal pathogenic agents. 

The vaccines of the present invention also include DNA vaccines. DNA 
vaccines are currently being developed for a number of infectious diseases. 
Boyer, J et al., Nat. Med. 5:526-532 (1997); reviewed in Spier, R., Vaccine 
14. 1285-1288 (1996). Such DNA vaccines contain a nucleotide sequence 
encoding one or more S. pneumoniae polypeptides of the present invention 
oriented in a manner that allows for expression of the subject polypeptide. The 
direct administration of plasmid DNA encoding B. burgdorgeri OspA has been 
shown to elicit protective immunity in mice against borrelial challenge. Luke, 

C. et al., J. Infect. Dis. 175:91-91 (1997). 

The present invention also relates to the administration of a vaccine 
which is co-administered with a molecule capable of modulating immune 
responses. Kim, J. et al.. Nature Biotech. 75:641-646 (1997), for example, 
report the enhancement of immune responses produced by DNA immunizations 
when DNA sequences encoding molecules which stimulate the immune 
response are co-administered. In a similar fashion, the vaccines of the present 
invention may be co-administered with either nucleic acids encoding immune 
modulators or the immune modulators themselves. These immune modulators 
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include granulocyte macrophage colony stimulating factor (GM-CSF) and 
CD86. 

The vaccines of the present invention may be used to confer resistance to 
streptococcal infection by either passive or active immunization. When the 
vaccines of the present invention are used to confer resistance to streptococcal 
infection through active immunization, a vaccine of the present invention is 
administered to an animal to elicit a protective immune response which either 
prevents or attenuates a streptococcal infection. When the vaccines of the 
present invention are used to confer resistance to streptococcal infection through 
passive immunization, the vaccine is provided to a host animal ( e.g ., human, 
dog, or mouse), and the antisera elicited by this antisera is recovered and 
directly provided to a recipient suspected of having an infection caused by a 
member of the Streptococcus genus. 

The ability to label antibodies,' or fragments of antibodies, with toxin 
molecules provides an additional method for treating streptococcal infections 
when passive immunization is conducted. In this embodiment, antibodies, or 
fragments of antibodies, capable of recognizing the 5. pneumoniae polypeptides 
disclosed herein, or fragments thereof, as well as other Streptococcus proteins, 
are labeled with toxin molecules prior to their administration to the patient. 
When such toxin derivatized antibodies bind to Streptococcus cells, toxin 
moieties will be localized to these cells and will cause their death. 

The present invention thus concerns and provides a means for 
preventing or attenuating a streptococcal infection resulting from organisms 
which have antigens that are recognized and bound by antisera produced in 
response to the polypeptides of the present invention. As used herein, a vaccine 
is said to prevent or attenuate a disease if its administration to an animal results 
either in the total or partial attenuation (i.e., suppression) of a symptom or 
condition of the disease, or in the total or partial immunity of the animal to the 

disease. 

The administration of the vaccine (or the antisera which it elicits) may be 
for either a "prophylactic” or "therapeutic" purpose. When provided 
prophylactically, the compound(s) are provided in advance of any symptoms of 
streptococcal infection. The prophylactic administration of the compound(s) 
serves to prevent or attenuate any subsequent infection. When provided 
therapeutically, the compound(s) is provided upon or after the detection of 
symptoms which indicate that an animal may be infected with a member of the 
Streptococcus genus. The therapeutic administration of the compound(s) serves 
to attenuate any actual infection. Thus, the S. pneumoniae polypeptides, and 
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fragments thereof, of the present invention may be provided either prior to the 

onset of mfectton (so as to prevent or attenuate an anticipated infection) or after 

tne initiation of an actual infection. 

The polypeptides of the invention, whether encoding a portion of a 
native protein or a functional derivative thereof, may be administered in pure 
orm or may be coupled to a macromolecular carrier. Example of such carriers 
are proteins and carbohydrates. Suitable proteins which may act as 
macromolecular carrier for enhancing the immunogenicity of the polypeptides of 
t e present invention include keyhole limpet hemacyanin (KLH) tetanus toxoid 
pertussis toxin, bovine serum albumin, and ovalbumin. Methods for coupline 
the polypeptides of the present invention to such macromolecular carriers are 
isclosed in Harlow et aL, Antibodies: A Laboratory Manual, 2nd Ed • Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1988) the 
entire disclosure of which is incorporated by reference herein. 

A composition is said to be "pharmacologically acceptable" if its 
a ministration pan be tolerated by a recipient animal and is otherwise suitable for 
administration to that animal. Such an agent is said to be administered in a 
t erapeutically effective amount" if the amount administered is physiologically 
significant. An agent is physiologically significant if its presence results in a 
etectable change in the physiology of a recipient patient. 

While m all instances the vaccine of the present invention is administered 
as a pharmacologically acceptable compound, one skilled in the an would 
recognize that the composition of a pharmacologically acceptable compound 
vanes with the animal to which it is administered. For example, a vaccine 
intended for human use will generally not be co-administered with Freund’s 
j ant. Further, the level of purity of the S. pneumoniae polypeptides of the 
present invention will normally be higher when administered to a human than 
when administered to a non-human animal. 

As would be understood by one of ordinary skill in the art, when the 
vaccine of the present invention is provided to an animal, i, may be i„ „ 
compos, non which may contain salts, buffers, adjuvants, or other substances 
which are desirable for improving the efficacy of the composition. Adjuvants 
are substances dial can be used to specifically augment a specific immune 
response. These substances generally perform two functions: (I) they protect 
■he antigen(s) from being rapidly catabolized after administration and (2) they 
nonspecifically stimulate immune responses 

Normally, the adjuvant and the composition are mixed prior to 
presentauon to the immune system, or presented separately, but into the same 
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site of the animal being immunized. Adjuvants can be loosely divided into 
several -roups based upon their composition. These groups include oil 
adjuvants (for example, Freund's complete and incomplete), mineral salts (for 
example, A1K(SC> 4 ) 2 , AlNa(S0 4 ) 2 , A1NH(S0 4 ), silica, kaolin, and carbon), 
polynucleotides (for example, poly IC and poly AU acids), and certain natural 
substances (for example, wax D from Mycobacterium tuberculosis , as well as 
substances found in Corynebacterium parvum, or Bordetella pertussis, and 
members of the genus Brucella. Other substances useful as adjuvants are the 
saponins such as, for example, Quil A. (Superfos A/S, Denmark). Preferred 
adjuvants for use in the present invention include aluminum salts, such as 
AlK(SO ) , AlNa(S0 4 ),, and A1NH 4 (S0 4 ). Examples of materials suitable for 

use in vaccine compositions are provided in Remington’s Pharmaceutical 
Sciences (Osol, A, Ed, Mack Publishing Co, Easton, PA, pp. 1324-1341 
(1980), which reference is incorporated herein by reference). 

The therapeutic compositions of the present invention can be 
administered fjarenterally by injection, rapid infusion, nasopharyngeal 
absorption (intranasopharangeally), dermoabsorption, or orally. The 
compositions may alternatively be administered intramuscularly, or 
intravenously. Compositions for parenteral administration include stenle 
aqueous or non-aqueous solutions, suspensions, and emulsions. Examples of 
non-aqueous solvents are propylene glycol, polyethylene glycol, vegetable oi s 
such as olive oil, and injectable organic esters such as ethyl oleate. Carriers or 
occlusive dressings can be used to increase skin permeability and enh 
antigen absorption. Liquid dosage forms for oral administration may generally 
comprise a liposome solution containing the liquid dosage form. Suitable forms 
for suspending liposomes include emulsions, suspensions, solutions, syrups, 
and elixirs containing inert diluents commonly used in the art, such as purified 
water. Besides the inert diluents, such compositions can also include adjuvants, 
wetting agents, emulsifying and suspending agents, or sweetening, flavoring, 

or perfuming agents. 

Therapeutic compositions of the present invention can also 
administered in encapsulated form. For example, intranasal immunization of 
mice against Bordetella pertussis infection using vaccines encapsulate in 
biodegradable microsphere composed of poly(DL-lactide-co-glycolide) has been 
shown to stimulate protective immune responses. Shahin, R. et al„ Infect . 
Immun 1195-1200 (1995). Similarly, orally administered encapsulated 
Salmonella typhimurium antigens have also been shown to elicit protective 
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immunity in mice. Allaoui-Attarki, K. et al, Infect. Immun. 65: 853-857 
(1997). Encapsulated vaccines of the present invention can be administered by 
a variety of routes including those involving contacting the vaccine with mucous 
membranes (e.g., intranasally, intracolonicly, intraduodenally). 

Many different techniques exist for the timing of the immunizations 
when a multiple administration regimen is utilized. It is possible to use the 
compositions of the invention more than once to increase the levels and 
diversities of expression of the immunoglobulin repertoire expressed by the 
immunized animal. Typically, if multiple immunizations are given, they will be 
given one to two months apart. 

According to the present invention, an "effective amount" of a 
therapeutic composition is one which is sufficient to achieve a desired biological 
effect. Generally, the dosage needed to provide an effective amount of the 
composition will vary depending upon such factors as the animal's or human's 
age, condition, sex, and extent of disease, if any, and other variables which can 
be adjusted by pne of ordinary skill in the art. 

The antigenic preparations of the invention can be administered by either 
single or multiple dosages of an effective amount. Effective amounts of the 
compositions of the invention can vary from 0.01-1,000 pg/ml per dose, more 
preferably 0. 1-500 pg/ml per dose, and most preferably 10-300 pg/ml per dose. 

Having now generally described the invention, the same will be more 
readily understood through reference to the following example which is 

provided by way of illustration, and is not intended to be limiting of the present 
invention, unless specified. 

Examples 

Example 1: Expression and Purification of S.- pneumoniae 

Polypeptides in E. coli 

The bacterial expression vector pQElO (QIAGEN, Inc., 9259 Eton 
Avenue, Chatsworth, CA, 91311) is used in this example for cloning of the 
nucleotide sequences shown in Table 1 and for expressing the polypeptides 
identified in Table 1. The components of the pQElO plasmid are arranged such 
that the inserted DNA sequence encoding a polypeptide of the present invention 
expresses the polypeptide with the six His residues ( i.e ., a "6 X His tag")) 
covalently linked to the amino terminus. 

The DNA sequences encoding the desired portions of the polypeptides 
of Table 1 are amplified using PCR oligonucleotide primers from either a DNA 
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library constructed from 5. pnuemonicae, such as the one deposited by the 
inventors at the ATCC for convenience, AT CC Deposit No. 97755, or from 
DNA isolated from the same organism such as the S. pneumoniae strain 
deposited with the ATCC as Deposit No. 55840. A list of PCR primers which 
can be used for this purpose is provided in Table 3, below. The PCR primers 
anneal to the nucleotide sequences encoding both the ammo terminal and 
carboxy terminal amino acid sequences of the desired portion of the 
polypeptides of Table 1. Additional nucleotides containing restriction sites to 
facilitate cloning in the pQElO vector were added to the 5' and 3' primer 
sequences, respectively. Such restriction sites are listed in Table 3 for each 
primer. In each case, the primer comprises, from the 5’ end, 4 random 
nucleotides to prevent "breathing" during the annealing process, a restncuon site 
(shown in Table 3), and approximately 15 nucleotides of S. pneumoniae ORF 
sequence (the complete sequence of each cloning primer is shown as SEQ ID 
NO:227 through SEQ ID NO:452). 

For cloning the polypeptides of Table 1, the 5' and 3’ primers were 
selected to amplify their respective nucleotide coding sequences. One of 
ordinary skill in the art would appreciate that the point in the protein coding 
sequence where the 5’ primer begins may be varied to amplify a DNA segment 
encoding any desired portion of the complete amino acid sequences described in 
Table 1. Similarly, one of ordinary skill in the art would further appreciate that 
the point in the protein coding sequence where the 3' primer begins may also be 
varied to amplify a DNA segment encoding any desired portion of the complete 

amino acid sequences described in Table 1 . 

The amplified DNA fragment and the pQElO vector are digested with the 

appropriate restriction enzyme(s) and the digested DNAs are then ligated 
together. The ligation mixture is transformed into competent E. coh cells using 
standard procedures such as those described in Sambrook er ai. Molecular 
Cloning: a Laboratory Manual, 2nd Ed.: Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y. (1989). Transformants are identified by their ability 
to «row under selective pressure on LB plates. Plasmid DNA is isolated from 
resistant colonies and the identity of the cloned DNA confirmed by restriction 

analysis, PCR and DNA sequencing. 

Clones containing the desired constructs are grown overnight ( O/N ) in 
liquid culture under selection. The O/N culture is used to inoculate a large 
culture, at a dilution of approximately 1:25 to 1:250. The cells are grown to an 
optical density at 600 nm ("OD600") of between 0.4 and 0.6. Isopropyl-b-D- 
thiogalactopyranoside ("IPTG") is then added to a final concentration of 1 mM 
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to induce transcription from the lac repressor sensitive promoter, by inactivating 
the lacl repressor. Cells subsequently are incubated further for 3 to 4 hours. 
Cells are then harvested by centrifugation. 

The cells are stirred for 3-4 hours at 4 C in 6M guanidine-HCl, pH 8. 
The cell debris is removed by centrifugation, and the supernatant containing the 
protein of interest is loaded onto a nickel-nitrilo-tri-acetic acid ("NiNTA") 
affinity resin column (available from QIAGEN, Inc., supra). Proteins with a 
6x His tag bind to the NI-NTA resin with high affinity and can be purified in a 
simple one-step procedure (for details see: The QIAexpressionist, 1995, 
QIAGEN, Inc., supra). Briefly, the supernatant is loaded onto the column in 6 
M guanidine-HCl, pH8, the column is first washed with 10 volumes of 6 M 
guanidine-HCl, pH8, then washed with 10 volumes of 6 M guanidine-HCl 
pH6, and finally the polypeptide is eluted with 6 M guanidine-HCl, pH 5.0. 

The purified protein is then renatured by dialyzing it against phosphate- 
buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. 
Alternatively, the protein can be successfully refolded while immobilized on the 
Ni-NTA column. The recommended conditions are as follows: renature using 
a linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl 
pH7.4, containing protease inhibitors. The renaturation should be performed 
over a period of 1.5 hours or more. After renaturation the proteins can be eluted 
by the addition of 250 mM imidazole. Imidazole is removed by a final dialyzing 
step against PBS or 50 mM sodium acetate pH6 buffer plus 200 mM NaCl. 
The purified protein is stored at 4°C or frozen at -80°C. 

The DNA sequences encoding the amino acid sequences of Table 1 may 
also be cloned and expressed as fusion proteins by a protocol similar to that 
described directly above, wherein the pET-32b(+) vector (Novagen, 601 
Science Drive, Madison, WI 537 1 1) is preferentially used in place of pQElO. 

Each of the polynucleotides shown in Table 1, was successfully 
amplified and subcloned into pQElO as described above using the PCR primers 
shown in Table 3. These pQElO plasmids containing the DNAs of Table 1, 
except SP023, SP042, SP054, SP063, SP081, SP092, SP114, SP122, 
SP123, SP126, and SP127, were deposited with the ATCC as a pooled deposit 
as a convenience to those of skill in the art. This pooled deposit was desposited 
on October 16, 1997 and given ATCC Deposit No. 209369. Those of ordinary 
skill in the art appreciate that isolating an individual plasmid from the pooled 
deposit is trivial provided the information and reagents described herein. Each 
of the deposited clones is capable of expressing its encoded S. pneumoniae 
polypeptide. 
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Example 2: Immunization and Detection of Immune Responses 
Methods 

Growth of bacterial innoculum, immunization of Mice and 
Challenge with S pneumoniae. 

Propagation and storage of, and challenge by S. pneumoniae are 
preformed essentially as described in Aaberge, I.S. et al.. Virulence of 
Streptococcus pneumoniae in mice: a standardized method for preparation and 
frozen storage of the experimental bacterial inoculum. Microbial Pathogenesis , 

18:141 (1995), incorporated herein by reference. 

Briefly, Todd Hewitt (TH) broth (Difco laboratories, Detroit, MI) with 
17% FCS, and horse blood agar plates are used for culturing the bacteria. Both 
broth and blood plates are incubated at 37°C in a 5% CO, atmosphere. Blood 
plates are incubated for 18 hr. The culture broth is regularly 10-fold serially 
diluted in TH broth kept at room temperature and bacterial suspensions are kept 

at room temperature until challenge of mice. 

For active immunizations C3H/HeJ mice (The Jackson Laboratory, Bar 
Harbor, ME) are injected intraperitoneally (i.p.) at week 0 with 20 g of 
recombinant streptococcal protein, or phosphate-buffered saline (PBS), 
emulsified with complete Freund’s adjuvant (CFA), given a similar booster 
immunization in incomplete Freund’s adjuvant (IF A) at week 4, and challenged 
at week 6. For challenge S. pneumoniae are diluted in TH broth from 
exponentially-growing cultures and mice are injected subcutaneously (s.c.) at 
the base of the tail with 0. 1 ml of these dilutions (serial dilutions are used to find 
medium infectious dose). Streptococci used for challenge are passaged fewer 
than six times in vitro. To assess infection, blood samples are obtained from 
the distal part of the lateral femoral vein into heparinized capillary tubes. A 25 
ul blood sample is serially 10-fold diluted in TH broth, and 25 ul of diluted and 
undiluted blood is plated onto blood agar plates. The plates are incubated for 18 
hr. and colonies are counted. 

Other methods are known in the art, for example, see Langermann, S. et 
al., J. Exp. Med., 180:2277 (1994), incorporated herein by reference. 
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Immunoassays 

Several immunoassay formats are used to quantify levels of 
streptococcal-specific antibodies (ELISA and immunoblot), and to evaluate the 
functional properties of these antibodies (growth inhibition assay). The ELISA 
and immunoblot assays are also used to detect and quantify antibodies elicited in 
response to streptococcal infection that react with specific streptococcal 
antigens. Where antibodies to certain streptococcal antigens are elicited by 
infection this is taken as evidence that the streptococcal proteins in question are 
expressed in vivo. Absence of infection-derived antibodies (seroconversion) 
following streptococcal challenge is evidence that infection is prevented or 
suppressed. The immunoblot assay is also used to ascertain whether antibodies 
raised against recombinant streptococcal antigens recognize a protein of similar 
size in extracts of whole streptococci. Where the natural protein is of similar, or 
identical, size in the immunoblot assay to the recombinant version of the same 
protein, this is taken as evidence that the recombinant protein is the product of a 
full-length cloqe of the respective gene. 

Enzyme-Linked Immunosorbant Assay (ELISA). 

The ELISA is used to quantify levels of antibodies reactive with streptococcus 
antigens elicited in response to immunization with these streptococcal antigens. 
Wells of 96 well microtiter plates (Immunlon 4, Dynatech, Chantilly, Virginia, 
or equivalent) are coated with antigen by incubating 50 1 of 1 g/ml protein 
antigen solution in a suitable buffer, typically 0. 1 M sodium carbonate buffer at 
pH 9.6. After decanting unbound antigen, additional binding sites are blocked 
by incubating 100 1 of 3% nonfat milk in wash buffer (PBS, 0.2% Tween 20, 
7.4). After washing, duplicate serial two-fold dilutions of sera in PBS, 
Tween 20, 1% fetal bovine serum, are incubated for 1 hr, removed, wells are 
washed three times, and incubated with horseradish peroxidase-conjugated goat 
anti-mouse IgG. After three washes, bound antibodies are detected with H 2 O 2 

and 2,2'-azino-di-(3-ethylbenzthiazoline sulfonate) (Schwan, T.G., et al . , Proc. 
Natl. Acad. Sci. USA 92:2909-2913 (1985)) (ABTS®, Kirkegaard & Perry 
Labs., Gaithersburg, MD) and A 4 Q 5 is quantified with a Molecular Devices, 

Corp. (Menlo Park, California) Vmax™ plate reader. IgG levels twice the 
background level in serum from naive mice are assigned the minimum titer of 
1 : 100 . 
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Sodiumdodecylsulfate-Polyacrylamide Gel Electrophoresis 

( SDS-PAGE ) and Immunoblotting 

Using a single well format, total streptococcal protein extracts or 
recombinant streptococcal antigen are boiled in SDS/2-ME sample buffer before 
electrophoresis through 3% acrylamide stacking gels, and resolving gels of 
higher acrylamide concentration, typically 10-15% acrylamide monomer. Gels 
are electro-blotted to nitrocellulose membranes and lanes are probed with 
dilutions of antibody to be tested for reactivity with specific streptococcal 
antigens, followed by the appropriate secondary antibody-enzyme (horseradish 
peroxidase) conjugate. When it is desirable to confirm that the protein had 
transferred following electro-blotting, membranes are stained with Ponceau S. 
Immunoblot signals from bound antibodies are detected on x-ray film as 
chemiluminescence using ECL™ reagents (Amersham Corp., Arlington 
Heights, Illinois). 

/ 

Example 3; Detection of Streptococcus mRNA expression 

Northern blot analysis is carried out using methods described by, among 
others, Sambrook et ah , supra, to detect the expression of the S . pneumoniae 
nucleotide sequences of the present invention in animal tissues. A cDNA probe 
containing an entire nucleotide sequence shown in Table 1 is labeled with 32p 
using the re<i/prime™ DNA labeling system (Amersham Life Science), 
according to manufacturer's instructions. After labeling, the probe is purified 
using a CHROMA SPIN- 100™ column (Clontech Laboratories, Inc.), 
according to manufacturer’s protocol number PT 1200-1. The purified labeled 
probe is then used to detect the expression of Streptococcus mRNA in an animal 
tissue sample. 

Animal tissues, such as blood- or spinal fluid, are examined with the 
labeled probe using ExpressHyb™ hybridization solution (Clontech) according 
to manufacturer’s protocol number PT 11 90-1. Following hybridization and 
washing, the blots are mounted and exposed to film at -70 C overnight, and 
films developed according to standard procedures. 
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It will be clear that the invention may be practiced otherwise than as 
particularly described in the foregoing description and examples. 

Numerous modifications and variations of the present invention are 
possible in light of the above teachings and, therefore, are within the scope of 
5 the appended claims. 

The entire disclosure of all publications (including patents, patent 
applications, journal articles, laboratory manuals, books, or other documents) 
cited herein are hereby incorporated by reference. 
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SP001 nucleotide (SEQ ID NO:l) 

TAAAATCTACGACAATAAAAATCAACTCATTGCTGACTTGGGTTCTGAACGCCGCGTCAATGCCCAAGC 

TAATGATATTCCCACAGATTTGGTTAAGGCAATCGTTTCTATCGAAGACCATCGCTTCTTCGACCACAG 

GGGGATTGATACCATCCGTATCCTGGGAGCTTTCTTGCGCAATCTGCAAAGCAATTCCCTCCAAGGTGG 

ATCAACTCTCACCCAACAGTTGATTAAGTTGACTTACTTTTCAACTTCGACTTCCGACCAGACTATTTC 

TCGTAAGGCTCAGGAAGCTTGGTTAGCGATTCAGTTAGAACAAAAAGCAACCAAGCAAGAAATCTTGAC 

CTACTATATAAATAAGGTCTACATGTCTAATGGGAACTATGGAATGCAGACAGCAGCTCAAAACTACTA 

TGGTAAAGACCTCAATAATTTAAGTTTACCTCAGTTAGCCTTGCTGGCTGGAATGCCTCAGGCACCAAA 

CCAATATGACCCCTATTCACATCCAGAAGCAGCCCAAGACCGCCGAAACTTGGTCTTATCTGAAATGAA 

AAATCAAGGCTACATCTCTGCTGAACAGTATGAGAAAGCAGTCAATACACCAATTACTGATGGACTACA 

AAGTCTCAAATCAGCAAGTAATTACCCTGCTTACATGGATAATTACCTCAAGGAAGTCATCAATCAAGT 

TGAAGAAGAAACAGGCTATAACCTACTCACAACTGGGATGGATGTCTACACAAATGTAGACCAAGAAGC 

TCAAAAACATCTGTGGGATATTTACAATACAGACGAATACGTTGCCTATCCAGACGATGAATTGCAAGT 

CGCTTCTACCATTGTTGATGTTTCTAACGGTAAAGTCATTGCCCAGCTAGGAGCACGCCATCAGTCAAG 

TAATGTTTCCTTCGGAATTAACCAAGCAGTAGAAACAAACCGCGACTGGGGATCAACTATGAAACCGAT 

CACAGACTATGCTCCTGCCTTGGAGTACGGTGTCTACGATTCAACTGCTACTATCGTTCACGATGAGCC 

CTATAACTACCCTGGGACAAATACTCCTGTTTATAACTGGGATAGGGGCTACTTTGGCAACATCACCTT 

GCAATACGCCCTGCAACAATCGCGAAACGTCCCAGCCGTGGAAACTCTAAACAAGGTCGGACTCAACCG 

CGCCAAGACTTTCCTAAATGGTCTAGGAATCGACTACCCAAGTATTCACTACTCAAATGCCATTTCAAG 

TAACACAACCGAATCAGACAAAAAATATGGAGCAAGTAGTGAAAAGATGGCTGCTGCTTACGCTGCCTT 

TGCAAATGGTGGAACTTACTATAAACCAATGTATATCCATAAAGTCGTCTTTAGTGATGGGAGTGAAAA 

AGAGTTCTCTAATGTCGGAACTCGTGCCATGAAGGAAACGACAGCCTATATGATGACCGACATGATGAA 

^SgS?gacttatggaactggacgaaatgcctatcttgcttggctccctcaggctggtaaaacagg 

AACCTCTAACTATACAGACGAGGAAATTGAAAACCACATCAAGACCTCTCAATTTGTAGCACCTGATGA 

ACTATTTGCTGGCTATACGCGTAAATATTCAATGGCTGTATGGACAGGCTATTCTAACCGTCTGACACC 

ACTTGTAGGCAATGGCCTTACGGTCGCTGCCAAAGTTTACCGCTCTATGATGACCTACCTGTCTGAAGG 

iAGcLTCCAGAAGATTGGAATATACCAGAGGGGCTCTACAGAAATGGAGAATTCGTATTT.AAAAATGG 

TGCTCGTTCTACGTGGAACTCACCTGCTCCACAACAACCCCCATCAACTGAAAGTTCAAGCTCATCAx 

AGATAGTTCAACTTCACAGTCTAGCTCAACCACTCCAAGCACAAATAATAGTACGACTACCAATCCT 

CAATAATACGCAACAATCAAATACAACCCCTGATCAACAAAATCAGAATCCTCAACCAGCACAACCA 


cpnni AMINO ACID (SEQ ID NO : 2 ) 

kiydnknqliadlgserrvnaqandiptdlvkaivsiedhrffdhrgidtirilgaflrnlqsnslqgg 

stltqqlikltyfststsdqtisrkaqeawlaiqleqkatkqeiltyyinkvymsngnygmqtaawyy 

gkdlnnlslpqlallagmpqapnqydpyshpeaaqdrrnlvlsemknqgyisaeqyekavntpitdglq 

slksasnypaymdnylkevinqveeetgynllttgmdvytnvdqeaqkhlwdiyntdeyvaypddelqv 

S^ng^lgarhq^nvsfginqavetnrdwgstmkpitdyapaleygvydstativhdep 

ynypgtntpvynwdrgyfgnitlqyalqqsrnvpavetlnkvglnraktflnglgidypsxh^snai 

nttesdkkygassekmaaayaafanggtyykpmyihkwfsdgsekefsnvgtramketta/mmtdmmk 

tvltygtgrnaylawlpqagktgtsnytdeeienhiktsqfvapdelfagytrkysmawtgysnrltp 

LVGNGLTVAAKVYRSMMTYLSEGSNPEDWNIPEGLYRNGEFVFKNGARSTWNSPAPQQPPSxESSSS 

DSSTSQSSSTTPSTNNSTTTNPNNNTQQSNTTPDQQNQNPQPAQP 


epooi nucleotide (SEQ ID NO 5 3 ) 

aaattacaatacggactatgaattgacctctggagaaaaattacctcttcctaaagagatttcaggtta 

cacttatattggatatatcaaagagggaaaaacgacttctgagtctgaagtaagtaatcaaaagagttc 

SSSc S cSIcL^caacaaaaggtggattataatgttacaccgaattttg^^ 

aacagtacaagctattcaggaacaaacacctgtttcttcaactaagccgacagaagttcaagtagttga 

aaaacctttctctactgaattaatcaatccaagaaaagaagagaaacaatcttcagattctcaagaa^ 

attagccgaacataagaatctagaaacgaagaaagaggagaagatttctccaaaagaaaagactggggt 

aaatacattaaatccacaggatgaagttttatcaggtcaattgaacaaacctgaactcttatatcgtga 

S^Iactatggagacaaaaatagattttcaagaagaaattcaagaaaatcctgatttagctgaaggaac 

tgtaagagtaaaacaagaaggtaaattaggtaagaaagttgaaatcgtcagaatattctctgtaaacaa 

ggaagaagtttcgcgagaaattgtttcaacttcaacgactgcgcctagtccaagaatagtcgaaaaagg 

SctS^c-cLgttataaaggaacaacctgagactggtgtagaacataaggacgtacagtctgg 

IgS^^a^cccgcaattcagcctgagttgcccgaagctgtagtaagtgacaaaggcgaaccaga 

agScIIc^^acccgaagcagttgtgaccgacaaaggtgagactgaggttcaaccagagtcgcc 

iSS^LrAAOTGATAAAGOTCAACCAGAGCSCGTAGCACCOCTTCCAOAATATWGGGTAATST 
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TGAGCAAGTAAAACCTGAAACTCCGGTTGAGAAGACCAAAGAACAAGGTCCAGAAAAAACTGAAGAAGT 
TCCAGTAAAACCAACAGAAGAAACACCAGTAAATCCAAATGAAGGTACTACAGAAGGAACCTCAATTCA 
AGAAGCAGAAAATCCAGTTCAACCTGCAGAAGAATCAACAACGAATTCAGAGAAAGTATCACCAGATAC 
ATCTAGCAAAAATACTGGGGAAGTGTCCAGTAATCCTAGTGATTCGACAACCTCAGTTGGAGAATCAAA 
T AAAC CAGAAC ATAATGACTCT AAAAATGAAAATTC AGAAAAAACTGT AG AAGAAGTTCC AGTAAATCC 
AAATGAAGGCACAGTAGAAGGTACCTCAAATCAAGAAACAGAAAAACCAGTTCAACCTGCAGAAGAAAC 
ACAAACAAACTCTGGGAAAATAGCTAACGAAAATACTGGAGAAGTATCCAATAAACCTAGTGATTCAAA 
ACCACCAGTTGAAGAATCAAATCAACCAGAAAAAAACGGAACTGCAACAAAACCAGAAAATTCAGGTAA 
TACAACATCAGAGAATGGACAAACAGAACCAGAACCATCAAACGGAAATTCAACTGAGGATGTTTCAAC 
CGAATCAAACACATCCAATTCAAATGGAAACGAAGAAATTAAACAAGAAAATGAACTAGACCCTGATAA 
AAAGGTAGAAGAACCAGAGAAAACACTTGAATTAAGAAATGTTTCCGACCTAGAGTTA 

SP004 amino acid (SEQ ID NO:4) 

NYNTDYELTSGEKLPLPKEISGYTYIGYIKEGKTTSESEVSNQKSSVATPTKQQKVDYNVTPNFVDHPS 

TVQAIQEQTPVSSTKPTEVQWEKPFSTELINPRKEEKQSSDSQEQLAEHKNLETKKEEKISPKEKTGV 

NTLNPQDEVLSGQLNKPELLYREETMETKIDFQEEIQENPDLAEGTVRVKQEGKLGKKVEIVRIFSVNK 

EEVSREIVSTSTTAPSPRIVEKGTKKTQVIKEQPETGVEHKDVQSGAIVEPAIQPELPEAWSDKGEPE 

VQPTLPEAWTDKGETEVQPESPDTWSDKGEPEQVAPLPEYKGNIEQVKPETPVEKTKEQGPEKTEEV 

PVKPTEETPVNPNEGTTEGTSXQEAENPVQPAEESTTNSEKVSPDTSSKNTGEVSSNPSDSTTSVGESN 

KPEHNDSKNENSEKTVEEVPVNPNEGTVEGTSNQETEKPVQPAEETQTNSGKIANENTGEVSNKPSDSK 

PPVEESNQPEKNGTATKPENSGWTTSENGQTEPEPSNGNSTEDVSTESNTSNSNGNEEIKQENELDPDK 

KVEEPEKTLELRNVSDLEL 

SP006 nucleotide (SEQ ID NO: 5) 

TGAGAATCAAGCTACACCCAAAGAGACTAGCGCTCAAAAGACAATCGTCCTTGCTACAGCTGGCGACGT 

GCCACCATTTGACTACGAAGACAAGGGCAATCTGACAGGCTTTGATATCGAAGTTTTAAAGGCAGTAGA 

TGAAAAACTCAGCGACTACGAGATTCAATTCCAAAGAACCGCCTGGGAGAGCATCTTCCCAGGACTTGA 

TTCTGGTCACTATCAGGCTGCGGCCAATAACTTGAGTTACACAAAAGAGCGTGCTGAAAAATACCTTTA 

CTCGCTTCCAATTTCCAACAATCCCCTCGTCCTTGTCAGCAACAAGAAAAATCCTTTGACTTCTCTTGA 

CCAGATCGCTGGTAAAACAACACAAGAGGATACCGGAACTTCTAACGCTCAATTCATCAATAACTGGAA 

TCAGAAACACACTGATAATCCCGCTACAATTAATTTTTCTGGTGAGGATATTGGTAAACGAATCCTAGA 

CCTTGCTAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCAAAAGATTATCAAGGACCG 

TGGTTTAGACCTCTCAGTCGTTGATTTACCTTCTGCAGATAGCCCCAGCAATTATATCATTTTCTCAAG 

CGACCAAAAAGAGTTTAAAGAGCAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCTTGA 

AAAACTCAGCAATACCTATCTAGGTGGTTCTTACCTCCCAGATCAATCTCAGTTACAA 

SP00 6 amino acid (SEQ ID NO: 6) 

ENQATPKETSAQKTIVLATAGDVPPFDYEDKGNLTGFDIEVLKAVDEKLSDYEIQFQRTAWESIFPGLD 

SGHYQAAANNLSYTKERAEKYLYSLPISNNPLVLVSNKKNPLTSLDQIAGKTTQEDTGTSNAQFINNWN 

QKHTDNPATINFSGEDIGKRILDLANGEFDFLVFDKVSVQKIIKDRGLDLSWDLPSADSPSNYIIFSS 

DQKEFKEQFDKALKELYQDGTLEKLSNTYLGGSYLPDQSQLQ 

SP007 nucleotide (SEQ ID NO: 7) 

TGGTAACCGCTCTTCTCGTAACGCAGCTTCATCTTCTGATGTGAAGACAAAAGCAGCAATCGTCACTGA 

TACTGGTGGTGTTGATGACAAATCATTCAACCAATCAGCTTGGGAAGGTTTGCAGGCTTGGGGTAAAGA 

ACACAATCTTTCAAAAGATAACGGTTTCACTTACTTCCAATCAACAAGTGAAGCTGACTACGCTAACAA 

CTTGCAACAAGCGGCTGGAAGTTACAACCTAATCTTCGGTGTTGGTTTTGCCCTTAATAATGCAGTTAA 

AGATGCAGCAAAAGAACACACTGACTTGAACTATGTCTTGATTGATGATGTGATTAAAGACCAAAAGAA 

TGTTGCGAGCGTAACTTTCGCTGATAATGAGTCAGGTTACCTTGCAGGTGTGGCTGCAGCAAAAACAAC 

TAAGACAAAACAAGTTGGTTTTGTAGGTGGTATCGAATCTGAAGTTATCTCTCGTTTTGAAGCAGGATT 

CAAGGCTGGTGTTGCGTCAGTAGACCCATCTATCAAAGTCCAAGTTGACTACGCTGGTTCATTTGGTGA 

TGCGGCTAAAGGTAAAACAATTGCAGCCGCACAATACGCAGCCGGTGCAGATATTGTTTACCAAGTAGC 

TGGTGGTACAGGTGCAGGTGTCTTTGCAGAGGCAAAATCTCTCAACGAAAGCCGTCCTGAAAATGAAAA 

AGTTTGGGTTATCGGTGTTGATCGTGACCAAGAAGCAGAAGGTAAATACACTTCTAAAGATGGCAAAGA 

ATCAAACTTTGTTCTTGTATCTACTTTGAAACAAGTTGGTACAACTGTAAAAGATATTTCTAACAAGGC 

AGAAAGAGGAGAATTCCCTGGCGGTCAAGTGATCGTTTACTCATTGAAGGATAAAGGGGTTGACTTGGC 

AGTAACAAACCTTTCAGAAGAAGGTAAAAAAGCTGTCGAAGATGCAAAAGCTAAAATCCTTGATGGAAG 

CGTAAAAGTTCCTGAAAAA 
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SP007 amino acid (SEQ XD NO : 8 ) 

GNRSSRNAASSSDVKTKAAIVTDTGGVDDKSFNQSAWEGLQAWGKEHNLSKDNGFTYFQSTSEADYANN 

LQQAAGSYNLIFGVGFALNNAVKDAAKEHTDLNYVLIDDVIKDQKNVASVTFADNESGYLAGVAAAKTT 

KTKQVGFVGGIESEVISRFEAGFKAGVASVDPSIKVQVDYAGSFGDAAKGKTIAAAQYAAGADIVYQVA 

GGTGAGVFAEAKSLNESRPENEKVWVIGVDRDQEAEGKYTSKDGKESNFVLVSTLKQVGTTVKDISNKA 

ERGEFPGGQVIVYSLKDKGVDLAVTNLSEEGKKAVEDAKAKILDGSVKVPEK 

SP008 nucleotide (SEQ XD NO : 9 ) 

TGTGGAAATTTGACAGGTAACAGCAAAAAAGCTGCTGATTCAGGTGACAAACCTGTT ATC AAAATG x AC 
CAAATCGGTGACAAACCAGACAACTTGGATGAATTGTTAGCAAATGCCAACAAAATCATTGAAGAAAAA 
GTTGGTGCCAAATTGGATATCCAATACCTTGGCTGGGGTGACTATGGTAAGAAAATGTCAGTTATCACA 
TCATCTGGTGAAAACTATGATATTGCCTTTGCAGATAACTATATTGTAAATGCTCAAAAAGGTGCTTAC 
GCTGACTTGACAGAATTGTACAAAAAAGAAGGTAAAGACCTTTACAAAGCACTTGACCCAGCTTACATC 
AAGGGTAATACTGTAAATGGTAAGATTTACGCTGTTCCAGTTGCAGCCAACGTTGCATCATCTCAAAAC 
TTTGCCTTCAACGGAACTCTCCTTGCTAAATATGGTATCGATATTTCAGGTGTTACTTCTTACGAAACT 

C TTG AGC C AGTC T TG AAAC AAATC AAAG AAAAAG CTCC AG AC GT AGT AC C ATTT GC T ATTGGT AAAGTT 

^TCATCCCATCTGATAATTTTGACTACCCAGTAGCAAACGGTCTTCCATTCGTTATCGACCTTGAAGGC 
GATAC T ACT AAAGTTGT AAAC CGTTACGAAGTGCC TCGTTTC AAAG AAC AC x TGAAGACTCTTCACAAA 
TTCTATGAAGCTGGCTACATTCCAAAAGACGTCGCAACAAGCGATACTTCCTTTGACCTTCAACAAGAT 
ACTTGGTTCGTTCGTGAAGAAACAGTAGGACCAGCTGACTACGGTAACAGCTTGCTTTCACGTGTTGCC 
AACAAAGATATCCAAATCAAACCAATTACTAACTTCATCAAGNAAAACCAAACAACACAAGTTGCTAAC 
TTTGTC ATCTC AAAC AACTCTAAG AAC AAAG AAAAATCAATGG AAATC TTG AAC CTCTTGAATACG AAC 
CCAGAACTCTTGAACGGTCTTGTTTACGGTCCAGAAGGCAAGAACTGGGAAAAAATTGAAGGTAAAGAA 
AACCGTGTTCGCGTTCTTGATGGCTACAAAGGAAACACTCACATGGGTGGATGGAACACTGGTAACAAC 
TGGATCCTTTAC ATC AACGAAAACGTTAC AGACCAACAAATCGAAAATTCTAAGAAAGAAT j. GGCAGAA 
GCTAAAGAATCTCCAGCGCTTGGATTTATCTTCAATACTGACAATGTGAAATCTGAAATCTCAGCTATT 
GCTAACACAATGCAACAATTTGATACAGCTATCAACACTGGTACTGTAGACCCAGATAAAGCGATTCCA 

GAATTGATGG AAAAATTG AAATCTG AAGGTGCCT ACG AAAAAGT ATTGAACG AAATGC AAAAAC AATAC 
GATGAATTCTTGAAAAACAAAAAA 

SP008 amino acid (SEQ ID NO: 10) 

CGNLTGNSKKAADSGDKPVIKMYQIGDKPDNLDEIjLANANKXIEEKVGAKLDIQYLGWGDYGKKMSVIT 
S5GENYDIAFADNYIVNAQKGAYADLTELYKKEGKDLYKALDPAYIKGNTVNGKIYAVPVAANVASSQN 
FAFNGTLLAKYGIDISGVTSYETLEPVLKQIKEKAPDWPFAIGKVFIPSDNFDYPVANGLPFVIDLEG 
DTTKWNRYEVPRFKEHLKTLHKFYEAGYIPKDVATSDTSFDLQQDTWFVREETVGPADYGNSLLSRVA 
NKDIQIKPITNFIKXNQTTQVANFVISNNSKNKEKSMEXLNLLNTNPELLNGLVYGPEGKNWEKXEGKE 
NRVRVLDGYKGNTHMGGWNTGNNWILYINENVTDQQXENSKKELAEAKES PALGF XFNTDNVKSEI SAI 
ANTMQQFDTAINTGTVDPDKAIPELMEKLKSEGAYEKVLNEMQKQYDEFLKNKK 

SP009 nucleotide (SEQ XD NO: 11) 

TGGTC AAGG AACTGC TTC T AAAG AC AAC AAAG AGGGAG AAC T T AAGAAGGTTG AC TTT AT C C T AG ACTG 
GACACCAAATACCAACCACACAGGGCTTTATGTTGCCAAGGAAAAAGGTTATTTCAAAGAAGCTGGAGT 
GGATGTTGATTTGAAATTGCCACCAGAAGAAAGTTCTTCTGACTTGGTTATCAACGGAAAGGCACCATT 
TGCAGTGTATTTCCAAGACTACATGGCTAAGAAATTGGAAAAAGGAGCAGGAATCACTGCCGTTGCAGC 
TATTGTTGAACACAATACATCAGGAATCATCTCTCGTAAATCTGATAATGTAAGCAGTCCAAAAGACTT 

GGTTGGT AAG AAATATGGGAC ATGGAATG AC C C AACTG AAC TTGCT ATGTTG AAAAC CTTGGT AGAATC 

TCAAGGTGGAGACTTTGAGAAGGTTG AAAAAGT ACCAAATAACGACTCAAACTCAATCACACCGATTGC 
CAATGGCGTCTTTGATACTGCTTGGATTTACTACGGTTGGGATGGTATCCTTGCTAAATCTCAAGGTGT 

AG ATGC T AAC TTC ATGT AC TTG AAAG AC T ATGTC AAGGAGTTTG ACT AC T ATTC AC C AGTT ATC ATC GC 

AAACAACGACTATCTGAAAGATAACAAAGAAGAAGCTCGCAAAGTCATCCAAGCCATCAAAAAAGGCTA 

CCAATATGCCATGGAACATCCAGAAGAAGCTGCAGATATTCTCATCAAGAATGCACCTGAACTCAAGGA 

AAAACGTGACTTTGTCATCGAATCTCAAAAATACTTGTCAAAAGAATACGCAAGCGACAAGGAAAAATG 

GGGTCAATTTGACGCAGCTCGCTGGAATGCTTTCTACAAATGGGATAAAGAAAATGGTATCCTTAAAGA 

AGACTTGACAGACAAAGGCTTCACCAACGAATTTGTGAAA 

SP009 amino acid (SEQ XD NO: 12) 
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GQGTASKDNKEAELKKVDFILDWTPNTNHTGLYVAKEKGYFKEAGVDVDLKLPPEESSSDLVINGKAPF 
AVYFQDYMAKKLEKGAGITAVAAIVEHNTSGI ISRKSDNVSSPKDLVGKKYGTWNDPTELAMLKTLVES 
QGGDFEKVEKVPNNDSNS ITPIANGVFDTAWI YYGWDGI LAKSQGVDANFMYLKDYVKEFDYYS PVI I A 
NNDYLKDNKEEARKVIQAIKKGYQYAMEHPEEAADILIKNAPELKEKRDFVIESQKYLSKEYASDKEKW 
GQFDAARWNAFYKWDKENGILKEDLTDKGFTNEFVK 

SP010 nucleotide (SEQ ID NO:13) 

TAGCTCAGGTGGAAACGCTGGTTCATCCTCTGGAAAAACAACTGCCAAAGCTCGCACTATCGATGAAAT 

CAAAAAAAGCGGTGAACTGCGAATCGCCGTGTTTGGAGATAAAAAACCGTTTGGCTACGTTGACAATGA 

TGGTTCTACCAAGGTACGCTACGATATTGAACTAGGGAACCAACTAGCTCAAGACCTTGGTGTCAAGGT 

TAAATACATTTCAGTCGATGCTGCCAACCGTGCGGAATACTTGATTTCAAACAAGGTAGATATTACTCT 

TGCTAACTTTACAGTAACTGACGAACGTAAGAAACAAGTTGATTTTGCCCTTCCATATATGAAAGTTTC 

TCTGGGTGTCGTATCACCTAAGACTGGTCTCATTACAGACGTCAAACAACTTGAAGGTAAAACCTTAAT 

TGTCACAAAAGGAACGACTGCTGAGACTTATTTTGAAAAGAATCATCCAGAAATCAAACTCCAAAAATA 

CGACCAATACAGTGACTCTTACCAAGCTCTTCTTGACGGACGTGGAGATGCCTTTTCAACTGACAATAC 

GGAAGTTCTAGCTTGGGCGCTTGAAAATAAAGGATTTGAAGTAGGAATTACTTCCCTCGGTGATCCCGA 

TACCATTGCGGCAGCAGTTCAAAAAGGCAACCAAGAATTGCTAGACTTCATCAATAAAGATATTGAAAA 

ATTAGGCAAGGAAAACTTCTTCCACAAGGCCTATGAAAAGACACTTCACCCAACCTACGGTGACGCTGC 

TAAAGCAGATGACCTGGTTGTTGAAGGTGGAAAAGTTGAT 

SP010 amino acid (SEQ ID NO: 14) 

SSGGNAGSSSGKTTAKARTIDEIKKSGELRIAVFGDKKPFGYVDNDGSTKVRYDIELGNQLAQDLGVKV 

KYISVDAANRAEYLISNKVDITLANFTVTDERKKQVDFALPYMKVSLGWSPKTGLITDVKQLEGKTLI 

VTKGTTAETYFEKNHPEIKLQKYDQYSDSYQALLDGRGDAFSTDNTEVLAWALENKGFEVGITSLGDPD 

TIAAAVQKGNQELLDFINKDIEKLGKENFFHKAYEKTLHPTYGDAAKADDLWEGGKVD 

SP011 nucleotide (SEQ ID NO: 15) 

CTCCAACTATGGTAAATCTGCGGATGGCACAGTGACCATCGAGTATTTCAACCAGAAAAAAGAAATGAC 

CAAAACCTTGGAAGAAATCACTCGTGATTTTGAGAAGGAAAACCCTAAGATCAAGGTCAAAGTCGTCAA 

TGTACCAAATGCTGGTGAAGTATTGAAGACACGCGTTCTCGCAGGAGATGTGCCTGATGTGGTCAATAT 

TTACCCACAGTCCATCGAACTGCAAGAATGGGCAAAAGCAGGTGTTTTTGAAGATTTGAGCAACAAAGA 

CTACCTGAAACGCGTGAAAAATGGCTACGCTGAAAAATATGCTGTAAACGAAAAAGTTTACAACGTTCC 

TTTTACAGCTAATGCTTATGGAATTTACTACAACAAAGATAAATTCGAAGAACTGGGCTTGAAGGTTCC 

TGAAACCTGGGATGAATTTGAACAGTTAGTCAAAGATATCGTTGCTAAAGGACAAACACCATTTGGAAT 

TGCAGGTGCAGATGCTTGGACACTCAATGGTTACAATCAATTAGCCTTTGCGACAGCAACAGGTGGAGG 

AAAAGAAGCAAATCAATACCTTCGTTATTCTCAACCAAATGCCATTAAATTGTCGGATCCGATTATGAA 

AGATGATATCAAGGTCATGGACATCCTTCGCATCAATGGATCTAAGCAAAAGAACTGGGAAGGTGCTGG 

CTATACCGATGTTATCGGAGCCTTCGCACGTGGGGATGTCCTCATGACACCAAATGGGTCTTGGGCGAT 

CACAGCGATTAATGAACAAAAACCGAACTTTAAGATTGGGACCTTCATGATTCCAGGAAAAGAAAAAGG 

ACAAAGCTTAACCGTTGGTGCGGGAGACTTGGCATGGTCTATCTCAGCCACCACCAAACATCCAAAAGA 

AGCCAATGCCTTTGTGGAATATATGACCCGTCCAGAAGTCATGCAAAAATACTACGATGTGGACGGATC 

TCCAACAGCGATCGAAGGGGTCAAACAAGCAGGAGAAGATTCACCGCTTGCTGGTATGACCGAATA.TGC 

CTTTACGGATCGTCACTTGGTCTGGTTGCAACAATACTGGACCAGTGAAGCAGACTTCCATACCTTGAC 

CATGAACTATGTCTTGACCGGTGATAAACAAGGCATGGTCAATGATTTGAATGCCTTCTTTAACCCGAT 

GAAAGCGGATGTGGAT 

SP011 amino acid (SEQ ID NO: 16) 

SNYGKSADGTVTIEYFNQKKEMTKTLEEITRDFEKENPKIKVKWNVPNAGEVLKTRVLAGDVPDWNI 

YPQSIELQEWAKAGVFEDLSNKDYLKRVKNGYAEKYAVNEKVYNVPFTANAYGIYYNKDKFEELGLKVP 

ETWDEFEQLVKDIVAKGQTPFGIAGADAWTLNGYNQLAFATATGGGKEANQYLRYSQPNAIKLSDPIMK 

DDIKVMDILRINGSKQKNWEGAGYTDVIGAFARGDVLMTPNGSWAITAINEQKPNFKIGTFMIPGKEKG 

QSLTVGAGDLAWSISATTKHPKEANAFVEYMTRPEVMQKYYDVXX3SPTAIEGVKQAGEDSPLAGMTEYA 

FTDRHLVWLQQYWTSEADFHTLTMNYVLTGDKQGMVNDLNAFFNPMKADVD 

SP012 nucleotide (SEQ ID NO: 17) 

TGGGAAAAATTCTAGCGAAACTAGTGGAGATAATTGGTCAAAGTACCAGTCTAACAAGTCTATTACTAT 

TGGATTTGATAGTACTTTTGTTCCAATGGGATTTGCTCAGAAAGATGGTTCTTATGCAGGATTTGATAT 

TGATTTAGCTACAGCTGTTTTTGAAAAATACGGAATCACGGTAAATTGGCAACCGATTGATTGGGATTT 
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GAAAGAAGCTGAATTGACAAAAGGAACGATTGATCTGATTTGGAATGGCTATTCCGCTACAGACGAACG 
CCGTGAAAAGGTGGCTTTCAGTAACTCATATATGAAGAATGAGCAGGTATTGGTTACGAAGAAATCATC 
TGGTATCACGACTGCAAAGGATATGACTGGAAAGACATTAGGAGCTCAAGCTGGTTCATCTGGTTATGC 
GGACTTTGAAGCAAATCCAGAAATTTTGAAGAATATTGTCGCTAATAAGGAAGCGAATCAATACCAAAC 
CTTTAATGAAGCCTTGATTGATTTGAAAAACGATCGAATTGATGGTCTATTGATTGACCG i GTCTATGC 
AAACTATTATTTAGAAGCAGAAGGTGTTTTAAACGATTATAATGTCTTTACAGTTGGACTAGAAACAGA 
AGCTTTTGCGGTTGGAGCCCGTAAGGAAGATACAAACTTGGTTAAGAAGATAAATGAAGCTTTTTCTAG 
TCTTTACAAGGACGGCAAGTTCCAAGAAATCAGCCAAAAATGGTTTGGAGAAGATGTAGCAACCAAAGA 

AGTAAAAGAAGGACAG 


SP012 nucleotide (SEQ ID NO: 18) 

GKNSSETSGDNWSKYQSNKSITIGFDSTFVPMGFAQKDGSYAGFDIDLATAVFEKYGITVNWQPIDWDL 

KEAELTKGTIDLIWNGYSATDERREKVAFSNSYMKNEQVLVTKKSSGITTAKDMTGKTLGAQAGSSGYA 

DFEANPEILKNIVANKEANQYQTFNEALIDLKNDRIDGLLIDRVYANYYLEAEGVLNDYNVFTVGLETE 

AFAVGARKEDTNLVKKINEAFSSLYKDGKFQEISQKWFGEDVATKEVKEGQ 


SP013 nucleotide (SEQ ID NO: 19) 

TGCTAGCGGAAAAAAAGATACAACTTCTGGTCAAAAACTAAAAGTTGTTGCTACAAACTCAATCATCGC 

TGATATTACTAAAAATATTGCTGGTGACAAAATTGACCTTCATAGTATCGTTCCGATTGGGCAAGACCC 

ACACGAATACGAACCACTTCCTGAAGACGTTAAGAAAACTTCTGAGGCTAATTTGATTTTCTATAACGG 

TATCAACCTTGAAACAGGTGGCAATGGaagpTTTACAAAATTGGTAGAAAATGCCAAGAAAACTGAAAA 

CAAAGACTACTTCGCAGTCAGCGACGGCGTTGATGTTATCTACCTTGAAGGTCAAAATGAAAAAGGAAA 

AGAAGACCCACACGCTTGGCTTAACCTTGAAAACGGTATTATTTTTGCTAAAAATATCGCCAAACAATT 

GAGCGCCAAAGACCCTAACAATAAAGAATTCTATGAAAAAAATCTCAAAGAATATACTGATAAGTTAGA 

CAAACTTGATAAAGAAAGTAAGGATAAATTTAATAAGATCCCTGCTGAAAAGAAACTCATTGTAACCAG 

CGAAGGAGCATTCAAATACTTCTCTAAAGCCTATGGTGTCCCAAGTGCTTACATCTGGGAAATCAATAC 

TGAAGAAGAAGGAACTCCTGAACAAATCAAGACCTTGGTTGAAAAACTTCGCCAAACAAAAGTTCCATC 

ACTCTTTGTAGAATCAAGTGTGGATGACCGTCCAATGAAAACTGTTTCTCAAGACACAAACATCCCAAT 

CTACGCTCAAATCTTTACTGACTCTATCGCAGAACAAGGTAAAGAAGGCGACAGCTACTACAGCATGAT 

GAAATACAACCTTGACAAGATTGCTGAAGGATTGGCAAAA 

SP013 amino acid. (SEQ ID NO: 20) 

ASGKKDTTSGQKLKWATNSIIADITKNIAGDKIDLHSIVPIGQDPHEYEPLPEDVKKTSEANLIFYNG 
INLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVIYLEGQNEKGKEDPHAWLNLENGIIr AKNIAKQL 
SAKDPNNKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKK.LIVTSEGAFKYFSKAYGVPSAYIWEINT 

EEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNI PIYAQIFTDSIAEQGKEGDSYYSMM 

KYNLDKIAEGLAK 

SP014 nucleotide (SEQ ID NO:21) 

TGGCTCAAAAAATACAGCTTCAAGTCCAGATTATAAGTTGGAAGGTGTAACATTCCCGCTTCAAGAAAA 

GAAAACATTGAAGTTTATGACAGCCAGTTCACCGTTATCTCCTAAAGACCCAAATGAAAAGTTAATTTT 

GCAACGTTTGGAGAAGGAAACTGGCGTTCATATTGACTGGACCAACTACCAATCCGACTTTGCAGAAAA 

ACGTAACTTGGATATTTCTAGTGGTGATTTACCAGATGCTATCCACAACGACGGAGCTTCAGATGTGGA 

CTTGATGAACTGGGCTAAAAAAGGTGTTATTATTCCAGTTGAAGATTTGATTGATAAATACATGCCAAA 

TCTTAAGAAAATTTTGGATGAGAAACCAGAGTACAAGGCCTTGATGACAGCACCTGATGGGCACATTTA 

CTCATTTCCATGGATTGAAGAGCTTGGAGATGGTAAAGAGTCTATTCACAGTGTCAACGATATGGCTTG 

GATTAACAAAGATTGGCTTAAGAAACTTGGTCTTGAAATGCCAAAAACTACTGATGATTTGATTAAA 

cctagaagctttcaaaaacggggatccaaAtggaaatggagaggctgatgaaattccattttcatttat 

tagJgctScggaaacgaagatttt^^^ 

tttagtagtaggaaatgatggcaaagttgacttcacagcagataacgataactataaagaaggtgtcaa 

ATTTATCCGTCAATTGCAAGAAAAAGGCCTGATTGATAAAGAAGCTTTCGAACATGATTGGAATAGTTA 

CATTGCTAAAGGTCATGATCAGAAATTTGGTGTTTACTTTACATGGGATAAGAATAATGTTACTGGAAG 

TAACGAAAGTTATGATGTTTTACCAGTACTTGCTGGACCAAGTGGTCAAAAACACGTAGCTCGTACAAA 

CGGTATGGGATTTGCACGTGACAAGATGGTTATTACCAGTGTAAACAAAAACCTAGAATTGACAGCTAA 

ATGGATTGATGCACAATACGCTCCACTCCAATCTGTGCAAAATAACTGGGGAACTTACGGAGATGACAA 

ACAACAAAACATCTTTGAATTGGATCAAGCGTCAAATAGTCTAAAACACTTACCACTAAACGGAACTGC 

acSgcagaacttcgtcaaaagactgaagtaggaggaccactagctatcctagattcatactatggtaa 

AGTAACAACCATGCCTGATGATGCCAAATGGCGTTTGGATCTTATCAAAGAATATTATGTTCCTTACAT 
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GAGCAATGTCAATAACTATCCAAGAGTCTTTATGACACAGGAAGATTTGGACAAGATTGCCCATATCGA 

AGCAGATATGAATGACTATATCTACCGTAAACGTGCTGAATGGATTGTAAATGGCAATATTGATACTGA 

GTGGGATGATTACAAGAAAGAACTTGAAAAATACGGACTTTCTGATTACCTCGCTATTAAACAAAAATA 

CTACGACCAATACCAAGCAAACAAAAAC 

SP014 amino acid (SEQ ID MO: 22) 

GSKNTASSPDYKLEGVTFPLQEKKTLKFMTASSPLSPKDPNEKLILQRLEKETGVHIDWTNYQSDFAEK 

RNLDISSGDLPDAIHNDGASDVDLMNWAKKGVIIPVEDLIDKYMPNLKKILDEKPEYKALMTAPDGHIY 

SFPWIEELGDGKESIHSVNDMAWINKDWLKKLGLEMPKTTDDLIKVLEAFKNGDPNGNGEADEIPFSFI 

SGNGNEDFKFLFAAFGIGDNDDHLWGNDGKVDFTADNDNYKEGVKFIRQLQEKGLIDKEAFEHDWNSY 

XAKGHDQKFGVYFTWDKNNVTGSNESYDVLPVLAGPSGQKHVARTNGMGFARDKMVITSVNKNLELTAK 

WIDAQYAPLQSVQNNWGTYGDDKQQNIFELDQASNSLKHLPLNGTAPAELRQKTEVGGPLAILDSYYGK 

VTTMPDDAKWRLDLIKEYYVPYMSNVNNYPRVFMTQEDLDKIAHIEADMNDYIYRKRAEWIVNGNIDTE 

WDDYKKELEKYGLSDYLAIKQKYYDQYQANKN 
SP015 nucleotide (SEQ ID NO:23) 

TAGTACAAACTCAAGCACTAGTCAGACAGAGACCAGTAGCTCTGCTCCAACAGAGGTAACCATTAAAAG 
TTCACTGGACGAGGTCAAACTTTCCAAAGTTCCTGAAAAGATTGTGACCTTTGACCTCGGCGCTGCGGA 
TACTATTCGCGCTTTAGGATTTGAAAAAAATATCGTCGGAATGCCTACAAAAACTGTTCCGACTTATCT 
. AAAAGACCTAGTGGGAACTGTCAAAAATGTTGGTTCTATGAAAGAACCTGATTTAGAAGCTATCGCCGC 
CCTTGAGCCTGATTTGATTATCGCTTCGCCACGTACACAAAAATTCGTAGACAAATTCAAAGAAATCGC 
CCCAACCGTTCTCTTCCAAGCAAGCAAGGACGACTACTGGACTTCTACCAAGGCTAATATCGAATCCTT 
AGCAAGTGCCTTCGGCGAAACTGGTACACAGAAAGCCAAGGAAGAATTGACCAAGCTAGACAAGAGCAT 
CCAAGAAGTCGCTACTAAAAATGAAAGCTCTGACAAAAAAGCCCTTGCGATCCTCCTTAATGAAGGAAA 
AATGGCAGCCTTTGGJGCCAAATCTCGTTTCTCTTTCTTGTACCAAACCTTGAAATTCAAACCAACTGA 
TACAAAATTTGAAGACTCACGCCACGGACAAGAAGTCAGCTTTGAAAGTGTCAAAGAAATCAACCCTGA 
CATCCTCTTTGTCATCAACCGTACCCTTGCCATCGGTGGGGACAACTCTAGCAACGACGGTGTCCTAGA 
AAATGCCCTTATCGCTGAAACACCTGCTGCTAAAAATGGTAAGATTATCCAACTAACACCAGACCTCTG 
GTATCTAAGCGGAGGCGGACTTGAATCAACAAAACTCATGATTGAAGACATACAAAAAGCTTTGAAA 

SP015 amino acid (SEQ ID NO: 24) 

STNSSTSQTETSSSAPTEVTIKSSLDEVKLSKVPEKIVTFDLGAADTIRALGFEKNIVGMPTKTVPTYL 

KDLVGTVKNVGSMKEPDLEAIAALEPDLIIASPRTQKFVDKFKEIAPTVLFQASKDDYWTSTKANIESL 

ASAFGETGTQKAKE ELTKLDKS I QEVATKNESSDKKALAI LLNEGKMAAFGAK3RFSF LYQTLKFKPTD 

TKFEDSRHGQEVSFESVKEINPDILFVINRTLAIGGDNSSNDGVLENALIAETPAAKNGKIIQLTPDLW 

YLSGGGLESTKLMIEDIQKALK 

SP016 nucleotide (SEQ ID NO:25) 

TGGCAATTCTGGCGGAAGTAAAGATGCTGCCAAATCAGGTGGTGACGGTGCCAAAACAGAAATCACTTG 

GTGGGCATTCCCAGTATTTACCCAAGAAAAAACTGGTGACGGTGTTGGAACTTATGAAAAATCAATCAT 

CGAAGCGTTTGAAAAAGCAAACCCAGATATAAAAGTGAAATTGGAAACCATCGACTTCAAGTCAGGTCC 

TGAAAAAATCACAACAGCCATCGAAGCAGGAACAGCTCCAGACGTACTCTTTGATGCACCAGGACGTAT 

CATCCAATACGGTAAAAACGGTAAATTGGCTGAGTTGAATGACCTCTTCACAGATGAATTTGTTAAAGA 

TGTCAACAATGAAAACATCGTACAAGCAAGTAAAGCTGGAGACAAGGCTTATATGTATCCGATTAGTTC 

TGCCCCATTCTACATGGCAATGAACAAGAAAATGTTAGAAGATGCTGGAGTAGCAAACCTTGTAAAAGA 

AGGTTGGACAACTGATGATTTTGAAAAAGTATTGAAAGCACTTAAAGACAAGGGTTACACACCAGGTTC 

ATTGTTCAGTTCTGGTCAAGGGGGAGACCAAGGAACACGTGCCTTTATCTCTAACCTTTATAGCGGTTC 

TGTAACAGATGAAAAAGTTAGCAAATATACAACTGATGATCCTAAATTCGTCAAAGGTCTTGAAAAAGC 

AACTAGCTGGATTAAAGACAATTTGATCAATAATGGTTCACAATTTGACGGTGGGGCAGATATCCAAAA 

CTTTGCCAACGGTCAAACATCTTACACAATCCTTTGGGCACCAGCTCAAAATGGTATCCAAGCTAAACT 

TTTAGAAGCAAGTAAGGTAGAAGTGGTAGAAGTACCATTCCCATCAGACGAAGGTAAGCCAGCTCTTGA 

GTACCTTGTAAACGGGTTTGCAGTATTCAACAATAAAGACGACAAGAAAGTCGCTGCATCTAAGAAATT 

CATCCAGTTTATCGCAGATGACAAGGAGTGGGGACCTAAAGACGTAGTTCGTACAGGTGCTTTCCCAGT 

CCGTACTTCATTTGGAAAACTTTATGAAGACAAACGCATGGAAACAATCAGCGGCTGGACTCAATACTA 

CTCACCATACTACAACACTATTGATGGATTTGCTGAAATGAGAACACTTTGGTTCCCAATGTTGCAATC 

TGTATCAAATGGTGACGAAAAACCAGCAGATGCTTTGAAAGCCTTCACTGAAAAAGCGAACGAAACAAT 

CAAAAAAGCTATGAAACAA 
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SP016 am ino acid ( SSQ ID NO:26) 

GNSGGSKDAAKSGGDGAKTEITWWAFPVFTQEKTGDGVGTYEKSIIEAFEKANPDIKVKLETIDFKSGP 

ekittaieagtapdvlfdapgriiqygkngklaelndlftdefvkdvnnenivqaskagdkaymypiss 

apfymamnkkmledagvanlvkegwttddfekvlkalkdkgytpgslfssgqggdqgtrafisnlysgs 

vtdekvskyttddpkfvkglekatswikdnlinngsqfdggadiqnfangqtsytilwapaqngiqakl 

leaskvewevpfpsdegkpaleylvngfavfnnkddkkvaaskkfiqfiaddkewgpkdvvrtgafpv 

rtsfgklyedkrmetisgwtqyyspyyntidgfaemrtlwfpmlqsvsngdekpadalkaftekaneti 

KKAMKQ 

SP017 nucleotide {SEQ ID NO: 27) 

ttcacaagaaaaaacaaaaaatgaagatggagaaactaagacagaacagacagccaaagctgatggaac 

agtcggtagtaagtctcaaggagctgcccagaagaaagcagaagtggtcaataaaggtgattactacag 

cattcaagggaaatacgatgaaatcatcgtagccaacaaacactatccattgtctaaagactataatcc 

aggggaaaatccaacagccaaggcagagttggtcaaactcatcaaagcgatgcaagaggcaggtttccc 

tattagtgatcattacagtggttttagaagttatgaaactcagaccaagctctatcaagattatgtcaa 

ccaagatggaaaggcagcagctgaccgttactctgcccgtcctggctatagcgaacaccagacaggctt 

ggcctttgatgtgattgggactgatggtgatttggtgacagaagaaaaagcagcccaatggctcttgga 

^CATGCAGCTGATTATGGCTTTGTTGTCCGTTATCTCAAAGGCAAGGAAAAGGAAACAGGCTATATGGC 

tgaagaatggcacctgcgttatgtaggaaaagaagctaaagaaattgctgcaagtggtctcagtttgga 

agaatactatggctttgaaggcggagactacgtcgat 

SP017 amino acid (SEQ ID NO: 28) 

sqektknedgetkteqtakadgtvgsksqgaaqkkaewnkgdyysiqgkydeiivankhyplskdynp 

genptakaelvklikamqeagfpisdhysgfrsyetqtklyqdyvnqdgkaaadrysarpgysehqtgl 

afdvxgtdgdlvteekaaqwlldhaadygfwrylkgkeketgymaeewhlryvgkeakeiaasglsle 

EYYGFEGGDYVD 

SP019 nucleotide (SEQ XD NO:29) 

gaaaggtctgtggtcaaataatcttacctgcggttatgatgaaaaaataatcttggaaaatataaatat 

aaaaatacctgaagaaaaaatatcagttattattgggtcaaatggttgtgggaaatcaacactcattaa 

aaccttgtctcgacttataaagccattagagggagaagtattgcttgataataaatcaattaattctta 

taaagaaaaagatttagcaaaacacatagctatattacctcaatctccaataatccctgaatcaataac 

agtagctgatcttgtaagccgtggtcgtttcccctacagaaagccttttaagagtcttggaaaagatga 

ccttgaaataataaacagatcaatggttaaggccaatgttgaagatctagcaaataacctagttgaaga 

actttctgggggtcaaaggcaaagagtatggatagctctagccctagcccaagatacaagtatcctact 

"’TTAGATGAGCCAACTACTTACTTGGATATCTCATATCAAATAGAACTATTAGACCTCTTGACTGATCT 

^AACCAAAAATATAAGACAACCATTTGCATGATTTTGCACGATATAAATCTAACAGCAAGATACGCTGA 

TTACCTATTTGCAATTAAAGAAGGTAAACTTGTTGCAGAGGGAAAGCCTGAAGATATACTAAATGATAA 

ACTAGTTAAAGATATCTTTAATCTTGAAGCAAAAATTATACGTGACCCTATTTCCAATTCGCCTCTAAT 

GATTCCTATTGGCAAGCACCATGTTAACTCT 

SPO 19 amino acid (SEQ ID NO: 30) 

KGLWSNNLTCGYDEKIILENXNIKIPEEKISVIIGSNGCGKSTLIKTLSRLIKPLEGEVLLDNKSINSY 

kekdlakhiailpqspxipesitvadlvsrgrfpyrkpfkslgkddleiinrsmvkanvedlannlvee 

LSGGQRQRVWI ALALAQDTS ILLLDE PTTYLD I SYQ I ELLDLLTDIjNQKYKTTICMI LHDINLTARYAD 
YLFAIKEGKLVAEGKPEDILNDKLVKDIFNLEAKI IRDP I SNS PLMIPIGKHHVS 

5P02 0 nucleotide (SEQ ID NO: 31) 

AAACTCAGAAAAGAAAGCAGACAATGCAACAACTATCAAAATCGCAACTGTTAACCGTAGCGGTTCTGA 

AGAAAAACGTTGGGACAAAATCCAAGAATTGGTTAAAAAAGACGGAATTACCTTGGAATTTACAGAGTT 

CACAGACTACTCACAACCAAACAAAGCAACTGCTGATGGCGAAGTAGATTTGAACGCTTTCCAACACTA 

TAACTTCTTGAACAACTGGAACAAAGAAAACGGAAAAGACCTTGTAGCGATTGCAGATACTTACATCTC 

TCCAATCCGCCTTTACTCAGGTTTGAATGGAAGTGCCAACAAGTACACTAAAGTAGAAGACATCCCAGC 

AAACGGAGAAATCGCTGTACCGAATGACGCTACAAACGAAAGCCGTGCGCTTTATTTGCTTCAATCAGC 

TGGCTTGATTAAATTGGATGTTTCTGGAACTGCTCTTGCAACAGTTGCCAACATCAAAGAAAATCCAAA 

gSg^aa^actgaattggacgctagccaaacagctcgttcattgtcatcagttgacgctgccgt 

TGTAAACAATACCTTCGTTACAGAAGCAAAATTGGACTACAAGAAATCACT'XTTCAAAGAACAAGCTGA 

TGAAAACTCAAAACAATGGTACAACATCATTGTTGCAAAAAAAGATTGGGAAACATCACCTAAGGCTGA 
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TGCTATCAAGAAAGTAATCGCAGCTTACCACACAGATGACGTGAAAAAAGTTATCGAAGAATCATCAGA 

TGGTTTGGATCAACCAGTTTGG 

SP020 amino acid (SEQ ID NO: 32) 

NSEKKADNATTIKIATVNRSGSEEKRWDKIQELVKKDGITLEFTEFTDYSQPNKATADGEVDLNAFQHY 

NFLNNWNKENGKDLVAIADTYISPIRLYSGLNGSANKYTKVEDIPANGEIAVPNDATNESRALYLLQSA 

GLIKLDVSGTALATVANIKENPKNLKITELDASQTARSLSSVDAAWNNTFVTEAKLDYKKSLFKEQAD 

ENSKQWYNI I VAKKDWET S P KADAI KKVIAA YHTDDVKKVI EES SDGLDQ PVW 

SP021 nucleotide (SEQ ID NO:33) 

TTCGAAAGGGTCAGAAGGTGCAGACCTTATCAGCATGAAAGGGGATGTCATTACAGAACATCAATTTTA 

TGAGCAAGTGAAAAGCAACCCTTCAGCCCAACAAGTCTTGTTAAATATGACCATCCAAAAAGTTTTTGA 

AAAACAATATGGCTCAGAGCTTGATGATAAAGAGGTTGATGATACTATTGCCGAAGAAAAAAAACAATA 

TGGCGAAAACTACCAACGTGTCTTGTCACAAGCAGGTATGACTCTTGAAACACGTAAAGCTCAAATTCG 

TACAAGTAAATTAGTTGAGTTGGCAGTTAAGAAGGTAGCAGAAGCTGAATTGACAGATGAAGCCTATAA 

GAAAGCCTTTGATGAGTACACTCCAGATGTAACGGCTCAAATCATCCGTCTTAATAATGAAGATAAGGC 

CAAAGAAGTTCTCGAAAAAGCCAAGGCAGAAGGTGCTGATTTTGCTCAATTAGCCAAAGATAATTCAAC 

TGATGAAAAAACAAAAGAAAATGGTGGAGAAATTACCTTTGATTCTGCTTCAACAGAAGTACCTGGAGC 

AAGTCCAAAAAAGCCGCTTTTCGCTTTTAGATGTGGGATGGTGTTTCTGGATGTGGATTACAGCAACTG 

GGGCACACCAAGCCTACAG 

SP021 amino acid (SEQ ID NO: 34) 

SKGSEGADLISMKGDVITEHQFYEQVKSNPSAQQVLLNMTIQKVFEKQYGSELDDKEVDDTIAEEKKQY 

GENYQRVLSQAGMTLETRKAQIRTSKLVELAVKKVAEAELTDEAYKKAFDEYTPDVTAQIIRLNNEDKA 

g^sl^^ EG ^ F ^ Q ^ KDNSTDEKTKENGGEITFDSAST ^ PGASPKKPLFAFRCGMVFLDVDYSNW 

SP022 nucleotide (SEQ ID NO:35) 

GGGGATGGCAGCTTTTAAAAATCCTAACAATCAATACAAAGCTATTACAATTGCTCAAACTCTAGGTGA 

TGATGCTTCTTCAGAGGAATTGGCTGGTAGATATGGTTCTGCTGTTCAGTGTACAGAAGTGACTGCCTC 

AAACCTTTCAACAGTTAAAACTAAAGCTACGGTTGTAGAAAAACCACTGAAAGATTTTAGAGCGTCTAC 

GTCTGATCAGTCTGGTTGGGTGGAATCTAATGGTAAATGGTATTTCTATGAGTCTGGTGATGTGAAGAC 

AGGTTGGGTGAAAACAGATGGTAAATGGTACTATTTGAATGACTTAGGTGTCATGCAGACTGGATTTGT 

AAAATTTTCTGGTAGCTGGTATTACTTGAGCAATTCAGGTGCTATGTTTACAGGCTGGGGAACAGATGG 

TAGCAGATGGTTCTACTTTGACGGCTCAGGAGCTATGAAGACAGGCTGGTACAAGGAAAATGGCACTTG 

GTATTACCTTGACGAAGCAGGTATCATGAAGACAGGTTGGTTTAAAGTCGGACCACACTGGTACTATGC 

CTACGGTTCAGGAGCTTTGGCTGTGAGCACAACAACACCAGATGGTTACCGTGTAAATGGTAATGGTGA 

ATGGGTAAAC 

SP022 amino acid (SEQ ID NO: 36) 

GMAAFKNPNNQYKAITIAQTLGDDASSEELAGRYGSAVQCTEVTASNLSTVKTKATWEKPLKDFRAST 
SDQSGWVE SNGKWYF YE SGDVKTGWVKTDGKWYYLNDLGVMQTG FVKF SG SWYYLSNSGAMFTGWGTDG 
SRWF YFDG SGAMKTGWYKENGTWYYLDEAG IMKTGWFKVG PHWYYAYG SGALAVSTTT PDGYRVNGNGE 

WVN 


SP023 nucleotide (SEQ XD NO: 37) 

AGACGAGC AAAAAATTAAGCAAGC AGAAGCGGAAGTTGAGAGTAAACAAGC TGAGGC TACAAGGTTAAA 
AAAAATCAAGACAGATCGTGAAGAAGCAGAAGAAGAAGCTAAACGAAGAGCAGATGCTAAAGAGCAAGG 
TAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTAGCAACACCTGATAAAAAAGAAAATGA 
TGCGAAGTCTTCAGATTCTAGCGTAGGTGAAGAAACTCTTCCAAGCCCATCCCTGAAACCAGAAAAAAA 
GGTAGCAGAAGCTGAGAAGAAGGTTGAAGAAGCTAAGAAAAAAGCCGAGGATCAAAAAGAAGAAGATCG 
CCGTAACTACCCAACCAATACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAA 
AAAAGCGGAGCTTGAACTAGTAAAAGAGGAAGCTAAGGAACCTCGAAACGAGGAAAAAGTTAAGCAAGC 
AAAAGCGGAAGTTGAGAGTAAAAAAGCTGAGGCTACAAGGTTAGAAAAAATCAAGACAGATCGTAAAAA 
AGCAGAAGAAGAAGCTAAACGAAAAGCAGCAGAAGAAGATAAAGTTAAAGAAAAACCAGCTGAACAACC 
ACAACCAGCGCCGGCTCCAAAAGCAGAAAAACCAGCTCCAGCTCCAAAACCAGAGAATCCAGCTGAACA 
ACCAAAAGCAGAAAAACCAGCTGATCAACAAGCTGAAGAAGACTATGCTCGTAGATCAGAAGAAGAATA 
TAATCGCTTGACTCAACAGCAACCGCCAAAAACTGAAAAACCAGCACAACCATCTACTCCAAAAACAGG 
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CTGGAAACAAGAAAACGGTATGTGGTACTTCTACAATACTGATGGTTCAATGGCGACAGGATGGCTCCA 

AAACAATGGCTCATGGTACTACCTCAACAGCAATGGCGCTATGGCGACAGGATGGCTCCAAAACAATGG 

TTCATGGTACTATCTAAACGCTAATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGTTCATGGTA 

CTACCTAAACGCTAATGGTTCAATGGCGACAGGATGGCTCCAATACAATGGCTCATGGTACTACCTAAA 

CGCTAATGGTTCAATGGCGACAGGATGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGG 

TGATATGGCGACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAA 

AGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATGTCAATGGCTCAGGTGCCCTTGCAGTCAA 

CACAACTGTAGATGGCTATGGAGTCAATGCCAATGGTGAATGGGTAAAC 

SP023 amino acid (SEQ ID NO:38) 

DEQKIKQAEAEVESKQAEATRLKKIKTDREEAEEEAKRRADAKEQGKPKGRAKRGVPGELATPDKKEND 

AKSSDSSVGEETLPSPSLKPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEVX 

KAELELVKEEAKEPRNEEKVKQAKAEVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDKVKEKPAEQP 

QPAPAPKAEKPAPAPKPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTG 

WKQENGMWYFYNTDGSMATGWLQNNGSWYYLNSNGAMATGWLQNNGSWYYLNANGSMATGWLQNNGSWY 

YLNANGSMATGWLQYNGSWYYLNANGSMATGWLQYNGSWYYLNANGDMATGWVKDGDTWYYLEASGAMK 

ASQWFKVSDKWYYVNGSGALAVNTTVDGYGVNANGEWVN 

SP025 imcleotiide (SEQ ID NO: 39) 

CTGTGGTGAGGAAGAAACTAAAAAGACTCAAGCAGCACAACAGCCAAAACAACAAACGACTGTACAACA 

AATTGCTGTTGGAAAAGATGCTCCAGACTTCACATTGCAATCCATGGATGGCAAAGAAGTTAAGTTATC 

TGATTTTAAGGGTAAAAAGGTTTACTTGAAGTTTTGGGCTTCATGGTGTGGTCCATGCAAGAAAAGTAT 

GCCAGAGTTGATGGAACTAGCGGCGAAACCAGATCGTGATTTCGAAATTCTTACTGTCATTGCACCAGG 

AATTCAAGGTGAAAAAACTGTTGAGCAATTCCCACAATGGTTCCAGGAACAAGGATATAAGGATATCCC 

AGTTCTTTATGATAQCAAAGCAACCACTTCCAAGCTTATCAAATTCGAAGCATTCCTACAGAATATT 

SP025 amino acid (SEQ ID NO: 40) 

CGEEETKKTQAAQQPKQQTTVQQIAVGKDAPDFTLQSMDGKEVKLSDFKGKKVYLKFWASWCGPCKKSM 

PELMELAAKPDRDFEI LTVIAPG IQGEKTVEQFPQWFQEQGYKDI PVLYDTKATTSKLI KFEAr LQNI 

SP028 nucleotide (SEQ ID NO: 41) 

GACTTTTAACAATAAAACTATTGAAGAGTTGCACAATCTCCTTGTCTCTAAGGAAATTTCTGCAACAGA 

ATTGACCCAAGCAACACTTGAAAATATCAAGTCTCGTGAGGAAGCCCTCAATTCATTTGTCACCATCGC 

TGAGGAGCAAGCTCTTGTTCAAGCTAAAGCCATTGATGAAGC tGGAATTGATGCTGACAATGTCCTTTC 

AGGAATTCCACTTGCTGTTAAGGATAACATCTCTACAGACGGTATTCTCACAACTGCTGCCTCAAAAAT 

GCTCTACAACTATGAGCCAATCTTTGATGCGACagCTgTTGCCAATGCAAAAACCAAGGGCATGATTGT 

CGTTGGAAAGACCAACATGGACGAATTTGCTATGGGTGGTTCAGGtGAAACTTCACACTACGGAGCAAC 

TAAAAACGCTTGGAACCACAGCAAGGTTCCTGGTGGGTCATCAAGTGGTTCTGCCGCAGCTGTAGCCTC 

AGGACAAGTTCGCTTGTCACTTGGTTCTGATACTGGTGGTTCCATCCGCCAACCTGCTGCCTTCAACGG 

AATCGTTGGTCTCAAACCAACCTACGGAACAGTTTCACGTTTCGGTCTCATTGCCTTTGGTAGCTCATT 

AGACCAGATTGGACCTTTTGCTCCTACTGTTAAGGAAAATGCCCTCTTGCTCAACGCTATTGCCAGCGA 

AGATGCTAAAGACTCTACTTCTGCTCCTGTCCGCATCGCCGACTTTACTTCAAAAATCGGCCAAGACAT 

CAAGGGTATGAAAATCGCTTTGCCTAAGGAATACCTAGGCGAAGGAATTGATCCAGAGGTTAAGGAAAC 

AATCTTAAACGCGGCCAAACACTTTGAAAAATTGGGTGCTATCGTCGAAGAAGTCAGCCTTCCTCACTC 

TAAATACGGTGTTGCCGTTTATTACATCATCGCTTCATCAGAAGCTTCATCAAACTTGCAACGCTTCGA 

CGGTATCCGTTACGGCTATCGCGCAGAAGATGCAACCAACCTTGATGAAATCTATGTAAACAGCCGAAG 

CCAAGGTTTTGGTGAAGAGGTAAAACGTCGTATCATGCTGGGTACTTTCAGTCTTTCATCAGGTTACTA 

TGATGCCTACTACAAAAAGGCTGGTCAAGTCCGTACCCTCATCATTCAAGATTTCGAAAAAGTCTTCGC 

GGATTACGATTTGATTTTGGGTCCAACTGCTCCAAGTGTTGCCTATGACTTGGATTCTCTCAACCATGA 

CCCAGTTGCCATGTACTTAGCCGACCTATTGACCATACCTGTAAACTTGGCAGGACTGCCTGGAATTTC 

GATTCCTGCTGGATTCTCTCAAGGTCTACCTGTCGGACTCCAATTGATTGGTCCCAAGTACTCTGAGGA 

AACCATTTACCAAGCTGCTGCTGCTTTTGAAGCAACAACAGACTACCACAAACAACAACCCGTGATTTT 

TGGAGGTGACAAC 


SP028 am i no acid (SEQ ID NO: 42) 

TFNNKTIEELHNLLVSKEISATELTQATLENIKSREEALNSFVTIAEEQALVQAKAIDEAGIDADNVLS 

GIPLAVKDNISTDGILTTAASKMLYNYEPIFDATAVANAKTKGMIWGKTNMDEFAMGGSGETSHYGAT 

KNAWNHSKVPGGSSSGSAAAVASGQVRLSLGSDTGGSIRQPAAFNGIVGLKPTYGTVSRFGLIAFGSSL 
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DQIGPFAPTVKENALLLNAIASEDAKDSTSAPVRIADFTSKIGQDIKGMKIALPKEYLGEGIDPEVKET 

n™^ AKHFEKLGAIVEEVSLPHSKYGVAVYYIIASSEASSNL QRFDGIRYGYRAEDATNLDEIYVNSRS 

QGFGEEVKRRIMLGTFSLSSGYYDAYYKKAGQVRTLIIQDFEKVFADYDLILGPTAPSVAYDLDSLNHD 

^^ladlltipwlaglpgisipagfsqglpvglqligpkyseetiyqaaaIf^ShkSqp^f 

SP030 nucleotide (SEQ ID NO: 43) 

^rrt CAGGTAAACAACTACAAGTCGGCGACAAGGCGCTTGATTTTTCT CTTACTACAACAGATCTTTC 

I^ A ^^ gtctggctgattttgatggcaagaa aaaa g tct tgag t g tcgttccttctatcgatacagg 
GA ^ gc * caactcaaacacgtcgttttaatgaaga attggctggactggacaacacggtcgtattgac 
caatggacctaccttttgctcaaaaacgttggtgcggtgctgaaggccttgacaatgcc attat 

S^S^ CTACTTTGACCATTCTTTCGGGCGCGATTATGCCCTCTTGATCAACGA atggcacctatt 

t?SgagccSS^^^ 

SP030 amino acid (SEQ id NO: 44) 

v^n?Dpro^ LDFSLTTTDLSKKSLADFDGKKKVLSWPSIDTGICSTQTRRFNEELAG LDNTWLT 

SEPNFE ° AEGLDNAIMLSDYFDHSFGRDYALLINEWHLLARAVFVLDTDNTIRYVEYVDNIN 

SP031 nucleotide (SEQ ID NO:45) 

S5 AGGCTGATACAAGTATCGCAGACATTCAAAAAAGAGGCG aactggttgtcggtgtcaaacaagacgt 

^^^ ATTTTGGTTACAAnGATCCCAAGACCGGTACTTATTCTGGTATCGAAaCCGAG TTGGCCAAGAT 

Ara^a^ ATGAACTCAAGGTCAAGATTCGCTATGTGCCGGTTACAGCACAAACCCGCGGGGG CCTTCT 

r , GAGAATGAACAGGT - CGATATGGATATCGCGACCT ' I 'TACCATCACGGACGAACGCAAAAAACTCTACAA 
^JU A CAGTCCCTACTACACAGACGCTTC ' I ’ GGA ' rTTTTGGTGA ATAAATCTGCCAAAATCAAAAAGAT 
^iS^ CTAAACGGCAAAACCATCGGAGTCGCCCAAGGTTCTATCACCCAACGCCTG ATTACTGAACT 
r^^^ GAAAGGTCTGAAGTTTAAATTCGTCGAACTTGGTTCCTACCCAGAA TTG A TTACT' T ’CCCT 

GGA ATCGTATCGATACCTTTTCCGTTGACCGCTC tattctatctggctacactagtaaacggac 

CAACGACTATCTTGATAACTTGGTTACTAAATGGAGCAAGGATGGTAGTTTGCAGAAACT^TATGACCG 

ttacaagctcaaaccatctagccatactgcagat 

SP031 amino acid (SEQ id NO: 46) 

IADIQKRGELWGVKQDVPNFGYXDPKTGTYSGIETDLAKMVADELKV KIRYVPVTAQTRGPLL 

GKKKGLKFEWP^ ITDERKKLYNFTS?YYTDASGFLVNKSAKIKKIEDLNGKTIGVAQGSITQRLITEL 

HDYLDl.LVTKMSKao^ 

SP032 nucleotide (SEQ ID NO: 47 ) 

rI^r^ ATCATTTGAAAACAAAGAAACAAACCGTGGTGTCTTgACTTTCACTATCT CTCAAGACCAAAT 

CAAACCAGAATTGGACCGTGTCTTCAAGtCAGTGAAGAAATCTCTTAATGTTCCAGGTTTCCGTAAAGG 
^ G J GG II GCACGCCCTATCTTCGACCAAAAATTTGG tga ag aagctctttatcaagatgcaatgaacgc 
CTTTTGCCAAACGCTTATGAAGCAGCTGTAAAAGAAGCTGGTCTTGAAGTGGTTGCCCAACCAAAAAT 
a^att^^ TTCAATGGAAAAAGGTCAAGACTGGGTTATCACTGCTGAAGTCGTTACAA a ac ctg aa gt 
™JI^ r ACTACAAAAACCTTGAAGTATCAGTTGATGTAGAAAAAGAAGTAACrcAGGG TGATGT 
GG ^ GAGCGTATCGAACGCGAACGCAACAACCTGGCTGA attggttatcaaggaagctgctgctgaaaa 
aaa GGAGACTGTTGTGATCGACTTCGTTGGTTCTATCGACGGTG t tg a at tt g a c ggtggaaaaggtga 

agctggSaaaJ^ ACT ^ TTCAGGTCAATTCATCCCTGGTTTCGAAGACCAATTGGTAGGTCACTC 
aSSa^^^ GTTATCGTAACATTCCCAGAAGACTACCAAGCAGAAGACCTTGCAGG taa 
Trn a a a a GTGACAACTATCCACGAAGTAAAAGCTAAAGAAGTTCCGGCTCTTGACGATGAACT 
t ^J A ^ Gaga ^’ gatgaagaagttgaaacacttgg tgacttgaaagaaaaatacagcaaagaattggc 
TGCTGCTAAAGAAGAAGCTTACAAAGATGCAGTTGAAGGTGCAGCAATTGATACAGCTGTAGAAAATGC 

;“ GMOTraGAAG ^ rcATccArc ™ 

^^T G GTCAAGGGATCAACCCTGACATGTACTTCCAAATCACTGGAA ctactcaagaagacct 

TCACAACCAATACCAAGCAGAAGCTGAGTCACGTACTAAGACTAACCTTGTTATCGAAGCAGTTGCCAA 
AGCTGAAGGATTTGATGCTTCAGAAGAAGAAATCCAAAAAGAAGTTGAGCAATTGGCAGCAGACTACAA 
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CATGGAAGTTGCACAAGTTCAAAACTTGCTTTCAGCTGACATGTTGAAACATGATATCACTATCAAAAA 

AGCTGTTGAATTGATCACAAGCACAGCAACAGTAAAA 

SP032 amino acid (SEQ ID NO:48) 

SVSFENKETNRGVLTFTISQDQIKPELDRVFKSVKKSLNVPGFRKGHLPRPIFDQKFGEEALYQDAMNA 

LLPNAYEAAVKEAGLEWAQPKIDVTSMEKGQDWVITAEWTKPEVKLGDYKNLEVSVDVEKEVTDADV 

EERIERERNNLAELVIKEAAAENGDTWIDFVGSIDGVEFDGGKGENFSLGLGSGQFIPGFEDQLVGHS 

AGETVDVIVTFPEDYQAEDLAGKEAKFVTTIHEVKAKEVPALDDELAKDIDEEVETLADLKEKYSKELA 

aakeeaykdavegaaidtavenaeivelpeemiheevhrsvneflgnlqrqginpdmyfqitgttqedl 

hnqyqaeaesrtktnlvieavakaegfdaseeeiqkeveqlaadynkevaqvqnllsadmlkhditikk 

AVELITSTATVK 

SP033 nucleotide (SEQ ID NOi49) 

tggtcaaaaggaaagtcagacaggaaaggggatgaaaattgtgaccagtttttatcctatctacgctat 

ggttaaggaagtatctggtgacttgaatgatgttcggatgattcagtcaagtagtggtattcactcc.t 

tgaaccttcggcaaatgatatcgcagccatctatgatgcagatgtctttgtttaccattctcataca 

cgaatcttgggcaggaagtctggatccaaatctaaaaaaatccaaagtgaaggtcttagaggcttctga 

gggaatgaccttggaacgtgtccctggactagaggatgtggaagcaggggatggagttgatgaaaaaac 

SctSgaccctcacacatggctagatcctgaaaaagctggagaagaagcccaaattatcgctgataa 

actttcagaggtggatagtgagcataaagagacttatcaaaaaaatgcgcaacctttatcaaaaaagct 

CAGGAAT 

SP033 amino acid. (SEQ XD NO: 50) t 

GOKESQTGKGMKIVTSFYPIYAMVKEVSGDLNDVRMIQSSSGIHSFEPSANDIAAIYDADVFVYriSHTL 

ESWAGSLDPNLKKSKJ/KVLEASEGMTLERVPGLEDVEAGDGVDEKTLYDPHTWLDPEKAGEEAQIIAD 

LSEVDSEHKETYQKNAQPLSKKLRN 


epn 3 a nucleotide (SEQ XD NO: 51) 

GAAGGATAGATATATTTTAGCATTTGAGACATCCTGTGATGAGACCAGTGTCGCCGTCTTGAAAAACGA 

cS?gagct^ 

CGAAGTAGCCAGTCGTCACCATGTCGAGGTCATTACAGCCTGTATCGAGGAGGCATTGGCAGAAGCAG 

S^ccgLgaggacgtgacagctgttgcggttacctacggaccaggcttggtcggagccttgctagt 

tggtttgtcagc-gccaaggcctttgcttgggctcacggacttccactgattcctgttaatcacatggc 

^^S^^TCAGAGTOTGGACCCTTTCKAGTrrCCCTTaCTAOCCCKTTGGTCAGCSG 

cggacacacagagttggtttatgtttcggaggcaggagattataagattgttggggaaacccgtgatg 

tgcggttggtgaggcttatgataaggtcggccgtgtcatgggcttgacctatcctgcaggtcgtgaga 

tgacgagctggctcatcaggggcaggatatttatgatttcccccgtgccatgattaaggaagataatct 

ggagttctccttctcaggtttgaaatctgcctttatcaatcttcatcacaatgccgagcaaaagggaga 

aagcctgtctacagaagatttgtgtgcttccttccaagcagcagttatggacattctcatggcaaaaac 

tctcagagaacgcctagcagccgaaatcacagatgtcaaggttatcatcccccctctgcgactctgcgg 

cctcaatgccaaaccaagtcttgcctttgataccatggaa 


SP034 amino acid (SEQ ID NO: 52) 

KDRYILAFETSCDETSVAVLKNDDELLSNVIASQIESHKRFGGWPEVASRHHVEVITACIe,EALAEAG 

iteedvtavavtygpglvgallvglsaakafawahglplipvnhmaghlmaaqsveplefpllallvs 

ghtelvyvseagdykivgetrddavgeaydkvgrvmgltypagreidelahqgqdiydfpramikedn 

efsfsglksafinlhhnaeqkgeslstedlcasfqaavhdilmaktkkalekypvkilwaggvaankg 

lrerlaaextdvkviipplrlcgdnagmiayasvsxwnkenfagwdlnakpslafdtme 

^ nucleotide (SEQ ID NO: 53) 

SiGGTGTTGAAGTTACACGCATCAACGACCTTACAGATCCAGTTATGCTTGCACACTTGTTGAAATA 

ATTCATCAAAGT^TCTGCTGAACGTGATCCAGAACAAATCGACTGGGCTACTGACGGTGTAGAAAT 

TCTTGAAGCTACTGGTTTCTTTGCTAAGAAAGAAGCAGCTGAAAAACACCTTAAAGGTGGAGCTAAAAA 
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AGTTGTTATCACTGCTCCTGGTGGAAACGACGTTAAAACAGTTGTATTCAACACTAACCACGACGTTPT 

tgacggtactgaaacagttatctcaggtgcttcatgtactacaaactgcttggot^SJSJSgc 

tcttcaagacaactttggtgttgttgaaggattgatgactactatccacgcttacactggSS^^ 

gatccttgacggaccacaccgtggtggtgaccttcgccgtgctcgcgctggtgcScIScScg^c 

TAACTCAACTGGTGCTGCAAAAGCTATCGGTCTTGTAATCCCAGAATTGAATGGTAAACTTGACGGATC 

t^t^^ CGTTCCAACTCC ' V ' CTGGATCAGTTACTGAATTGGTAGCAGTTCTTGA aaagaacgttac 

GTTGATGAAGTGAACGCAGCTATGAAAGCAGCTTCAAACGAATCATACGGTTACACAGAAGATCCAAT 

CGTATCTTCAGATATCGTAGGTATGTCTTACGGTTCATTGTTTGACGCAACTCAAACTAAAGTTCTTGA 

^II° ACGGTAAACAATTGGTTAAAGTTGTATCATGGT acgacaacgaaatgtcatacactgcacaact 

TGTTCGTACTCTTGGAATACTTCGCAAAAATTGC i^GACAACT 

SP035 amino acid (SEQ ID NO: 54) 

WICT/GINGFGRIGRLAFRRIQNVEGVEVTRXNDLTDPVMLAHLLKYDTTQGRFDGTVEVKEGGFEVNGK 

‘ ^Yf, A ^ Rr ’ PEQIDWATDGVEIVLEATGFFAKKEAAEKHL KGGAKKWITAPGGNDVKTWFNTNHDVL 
. SCTTNCLAPMAKALQDNFGVVEGLMT TIHAYTGDQMI LDGPHRGGDLRRARAGAANIVP 

v ^^^ aiglvipelngkldgsaqrvptptgsvtelv avleknvtvdevnaamkaasnesygytedpi 

'/SSDIVGMSYGSLFDATQTKVLDVDGKQLVKWSWYDNEMSYTAQLVRTLGILRKNC 

SP03 6 nucleotide (SEQ ID NO:55) 

TTCTTACGAGTTGGGACTGTATCAAGCTAGAACGGTTAAGGAAAATAATCGTGTTTCCTATATAGATGC 

AAAACAAGCGACGCAAAAAACGGAGAATTTGACTCCTGATGAGGTTAGCAAGCGTGAAGGAATCAATGC 

raa^^ TCGTGATCAAGATAACAGACCAAGGCTATGTCACTTCACATGGCGACCACTA TCATTATTA 

AATGGTAAGGTTCCTTATGACGCTATCATCAGTGAAGAATTACTCATGAAAGATCCAAACTATAAGCT 

f^ GA ^ GAGGA : ATTGTTAATGAGGTCAAGGGTGGATATGT T A TCAAGGTAGATGGAAAATACTATGT 

_^ AGCTTAAGGATGC " rGCCCACGCGGATAACGTCCGTA CAAAAGAGGAAATCAATCGACAAAAACAAGA 

ar^J^& TCAACATCGTGAAGGTGGAACTCCAAGAAACGATGGTGCTGTTGCCTTGGCACGT TCGCAAGG 

AGGG ^ A ^ Agtacagatgatggttatatctttaatgcttg tgatatcatagaggatactggtgatgctta 

* ^ CGTTCCTCATGGAGATCATTACeATTACATTCCTAAGAA TGAGTTATCAGCTAGCGAGTTGGCTGC 
^aa??^o CTTCCTATCTGGTCGAGGAAATCTGTCAAATTCAAGAACCTATCGCCGACAAAA tagcga 
^ GAACAAACTGGGTACC ^ CTGTAAGCAATCCAGGAACTACAAATAC T aa CACAAGcS 
,^ GAGG CACTAACAGTCAAGCAAGTCAAAGTAATGAGA ttgatagtctcttgaaacagctctacaa 

AAPArr^rj^ G I CAACGACATGTAGAATCTGATCGCCTTGTCTTTGATCCAGCACAAATCACAAG TCG 

CAGCTAGAGGTGTTGCAGTGCCACACGGAGATCATTACCACTTCATCCCTTACTCTCAAATGTCTGA 

cccS^^ rcGCTCGT ^^ 

CAACCAAGTCCACAACCGACTC CGGAACC TAGTCCAGGCCCGCAACCTGCACCAAATCTTAA 

^^ GAGTCAAATTCTTCTTTGGTTAGTCAGCTGGTAGGAAAA gttggggaaggatatgtattcgaaga 
^ GG gTTATGTCTTTGCGAAAGATTTACCATCTGAAAG tgttaaaaatcttgaaa^L 

ap a^t't«t< AAACAAGAGAGTGTTTCACACACTTTAACTGCTAAAAAAGAAAATG t'1’ g ctcctcgtgacca 

agaattttatgataaagcatataatctgttaactgaggctcataaagccttgtttgnaaataagggtcg 

if^^I5 TGATTTCCAAGCCTTAGACAAATTATTAGAACGCTTGAATGA TGAATCGACTAATAAAGAAAA 

TpIaa^pap^ ATTTATTGGCATTCCTAGCACCAATTACCCATCCAGAGCGACTTGGCAAAGGAAA TTC 

^^Z2 A CTGAAGACGAAGTTGGTATTGCTCAATTAGCTGA taag tatacaacgtcagatgg 
;™ GA ^ atgaacatgatataatcagtgatgaaggaga t gcat a tgtaacg cctc a tatgggcca 
IaaIpp^ GGAAAAGATAGCCTTTCTGAT ^ ggaaaaagttgcagctcaag cctatactaaaga 

^^^apa TCCTACCTCCATCTCCAGACGCAGATGTTAAAGCAAATCCAACTGGAGATAGT GGAGCAGC 

fl „ T „^ A ‘^ TCGTGTGAAAGGGGAAAAACGAATTCCACT CGTTCGACTTCCATATATGGTTGAGCATAC 
^ AAAAACGGTAATTTCATTATOCCTCATAAGGATCA ^ accataatat t a aatttgcttg 
G Z!^ GAGATCACACATACAAAGCTCCAAATGGCTATA CC tTGGAAGATTTGTTTGCGACGATTAAGTA 

A GPP a A AG AACACCCTGACGAACGTCCACATTCTAATGATGGATGGGGC AATGGCAGTGACC AT GTGTT 

aS^T^p?apaa^ TGAAGATCCAAATAAGAACTTCAAAGCGGATGAAGAGCCAGTAGAGGAAAG 

TTTGCTTGPPA a^pt^^p^^ CTCAAGTAGAGACTGAAAAAGTAGAAGCCCAAGTCAAAGAAGGAGAAG '^ 

T a™^ GGAAAGTAACGGATTCTAGTCTGAAAGCCAATGCAACAGAAAC 'tct a gctggtttacgaaa 

^ TCTTCAAATTATGGATAACAATAGTATCAT ^ CAGAAGCAGAAAAA TT A CTTGCGTTGTO 

aaaaggaagtaatccttcatctgtaagtaaggaaaaaataaac 


SP036 amino acid (SEQ ID MO: 56) 

^ ELGE J GAP T!f <ENNRVSYIDGKGATGKTENL TPDEVSKREGINAEQIVIKITDQGYVTSHGDHYHYY 

NGKVPYDAIIStELLMKDPNYKLKDEDIVNEVKGGYVIKVDGKYYVYLKDAAHADNVRTKEEINRQKQE 
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HSQHREGGTPRNDGAVALARSQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASELAA 

AEAFLSGRGNLSNSRTYRRQNSDNTSRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYK 

LPLSQRHVESDGLVFDPAQITSRTARGVAVPHGDHYHFI PY SQMSELEERIARI I PLRYRSNHWVPDSR 

peqSpStpepspgpqpapnlkidsnsslvsqlvrkvgegyvfeekgisryvfakdlpsetvknlesk 

LSKQESVSHTLTAKKENVAPRDQEFYDKAYNLLTEAHKALFXNKGRNSDFQALDKLLERLNDESTNKEK 

lSdllIf^pithperlgkpnsqxeytedevriaqladkyttsdgyxfdehdiisdegdaywphmgh 

SHWIGKDSLSDKEKVAAQAYTKEKGILPPSPDADVKANPTGDSAAAIYNRVKGEKRIPLVRLPYMVEhT 
VEVKNGNLI I PHKDHYHNIKFAWFDDHTYKAPNGYTLEDLFATIKYYVEHPDERPHSNDGWGNASEHVL 

GKKDHSEDPNKNFKADEEPVEETPAEPEVPQVETEKVEAQLKEAEVLLAKVTDSSLKANATETLAGLRN 

NLTLQIMDNNSIMAEAEKLLALLKGSNPSSVSKEKIN 

SP038 nucleotide (SEQ I© NO: 57) 

TACTGAGATGCATCATAATCTAGGAGCTGAAAAGCGTTCAGCAGTGGCTACTACTATCGATAGTTTTAA 

GGAGCGAAGTCAAAAAGTCAGAGCACTATCTGATCCAAATGTGCGTTTTGTTCCCTTCTTTGGCTCTAG 

TGAATGGCTTCGTTTTGACGGTGCTCATTCTGCGGTATTAGCTGAGAAATACAATCGTTCCTACCGTCC 

TTATCTTTTAGGACAGGGGGGAGCTGCATCGCTTAACCAATATTTTGGAATGCAACAGATGTTACCACA 

GCTGGAGAATAAACAAGTTGTGTATGTTATCTCACCTCAGTGGTTCAGTAAAAATGGCTATGATC 

AGCCTTCCAGCAGTATTTTAATGGAGACCAGTTGACTAGTTTTCTGAAACATCAATCTGGGGATCAGGC 

TAGTC AATATGC AGCGACTCGCTTACTGCAAC AGTTCCCAAACGTAGCT ATGAAGGACCTGGTTC 

GTTGGCAAGTAAAGAAGAATTGTCGACAGCAGACAATGAAATGATTGAATTATTGGCTCGTT * 

ACGCCAAGCTTCCTTTTTTGGTCAGTTTTCGGTTAGAGGCTATGTTAACTACGATAAG^ATGTAGCTAA 

GTATTTAAAAATCTTGCCAGACCAGTTTTCTTATCAGGCAATAGAAGATGTTGTCAAAGCAGATGCTGA 

AAAAAATACTTCCAATAATGAGATGGGAATGGAAAATTATTTCTATAATGAGCAGATCAAGAAGGATTT 

GAAGAAATTAAAGGATTCTCAGAAAAGCTTTACCTATCTCAAGTCGCCAGAGTATAATGNNTTGCAGTT 

GGTTTTAACACAGTTTTCTAAATCTAAGGTAAACCCGATTTTTATCATTCCACCTGTTAATAAAAAATG 

Stcna™gctggtctacgagaggatatgtacc.aacaaacggtgcagaagattcgctaccagttaga 

Sg??aaggttttaccaatatagcagatttttctaaggacggcggggagcctttctttatgaaggacac 

cattcaccttggttggttgggttggttggcttttgacaaggcagttgatcctttcctatccaatcccac 

accagctccgacttaccatctgaatgagcgctttttcagcaaagattgggcgacttatgatggaga 

CAAAGAA 

eT>«;o amino acid. (SEQ ID NO: 58) 

TEMHHNLGAEKRSAVATTIDSFKERSQKVRALSDPNVRFVPFFGSSEWLRFDGAHSAVLAEKYNRSYRP 

YSGSSSSSR5MQQMLPQLENKQVVYVXSPQWFSKNGYDPA.AFQQYFNGDQLTSFLKHQSGDQA 

SQYAATRLLQQFPNVAMKDLVQKLASKEELSTADNEMXELLARFNERQASFFGQFSVRGYVNYDIGiyAK 

YLKILPDQFSYQAIEDWKADAEKNTSNNEMGMENYFYNEQIKKDLKKLKDSQKSFTYLKSP Q 

VLTQFSKSKVNPIFIIPPVNKKWMXYAGLREDMYQQTVQKIRYQLESQGFTNIADFSKDGGEPFFMKDT 

I HLGWLGWLAFDKAVD PF LSNPT PAPTYHLNERFF SKDWATYDGDVKE 


ggttttgag^ctatttgcagggggccctgattgagtcgattgagcaagtggaaaatgaccgtattgt 

ggggaaacacagtaatattctactggtcgataaaagcagtcataaaatcctcgaagttatcaaacacgt 

S^iS^TAGCTACCGCACCTTACTTCCAGGATCGACCTATATCGCTCCGCCAAGTACAAA 

a^StcStc^tactatcaaggatgaaaagctctttgaaatcctgcaaacccaagaactaacagc 

aaaaaatcttcaaagcctctttcaaggtctgggacgcgatacggcaaatgaattggaaaggatactggt 

tagtgaaaaactttccgctttccgaaattttttcaatcaagaaaccaagccatgcttgactgagacttc 

cttcagtccagttccttttgcaaatcaggtgggagagccttttgcaaatctttctgatttgttggacac 

ctISataaggataaggctgagcgcgaccgcgtcaaacagcaggccagtgaactgattcgtcgtgttga 

AAATGAACTTCAGAAAAACCGACACAAACTCAAAAAACAGGAAAAAGAGTTACTGGCGACAGACAA 

TGAAGAATTTCGTCAAAAAGGAGAATTGCTGACAACCTTCCTCCACCAAGTGCCTAACGACCAAGAC 

ggttatcctagacaactactataccaaccaacctatcatgattgcgcttgataaggctctgactcccaa 

ccagaatgcccaacgctattttaaacggtatcagaaactcaaagaagctgtcaaatacttgactgatt 

gaSg^Sgccactattctctatctggaaagtgtagaaaccgtcctcaaccaagctggact 

GGAAGAAATCGC^GAAATCCGTGAAGAATTGATTCAAACAGGTTTTATCCGCAGAAGACAACGGGAGAA 

AATCCAGAAACGCAAAAAACTAGAACAATATCTAGCAAGCGATGGCAAAACCATCATCTATGTCGGACG 

aaacaatcttcaaaatgaggaattgacctttaaaatggcccgcaaggaggaactttggttccatgctaa 

GGACATTCCTGGAAGCCATGTTGTCATCTCAGGAAATCTTGACCCATCTGATGCAGTCAAGACAGACGC 
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AGCAGAGTTAGCTGCCTACTTCTCTCAAGGGCGCCTGTCGAATCTGGTGCAGGTAGATATGATTGAAGT 

CAAAAAACTCAATAAACCAACTGGTGGAAAACCCGGCTTTGTCACTTACACAGGACAAAAGACCCTCCG 

CGTCACACCAGACTCCAAAAAAATTGCATCCATGAAAAAATCC 

SP039 amino acid (SEQ XD NO: 60) 

VLRKYLQGALIESIEQVENDRIVEITVSNKNEIGDHIQATLIIEIMGKHSNILLVDKSSHKILEVIKHV 

gfsqnsyrtllpgstyiappstkslnpftikdeklfeilqtqeltaknlqslfqglgrdtanelerilv 

SEKLSAFRNFFNQETKPCLTETSFSPVPFANQVGEPFANLSDLLDTYYKDKAERDRVKQQASELIRRVE 

NELQKNRHKLKKQEKELLATDNAEEFRQKGELLTTFLHQVPNDQDQVI LDNYYTNQP imialdkaltpn 
QNAQRYFKRYQKLKEAVKYLTDLIEETKATILYLESVETVLNQAGLEEIAEIREELIQTGFIRRRQREK 
IQKRKKLEQYLASDGKTI IYVGRNNLQNEELTFKMARKEELWFHAKDI PGSHWX SGNLDP SDAVKTDA 
AELAAYFSQGRLSNLVQVDMIEVKKLNKPTGGKPGFVTYTGQKTLRVTPDSKKIASMKKS 

SP040 nucleotide (SEQ ID NO: 61) 

GACAACATTTACTATCCATACAGTAGAGTCAGCACCAGCAGAAGTGAAAGAAATTCTTGAAACAGTAGA 

AAAAGACAACAATGGCTATATTCCCAACCTAATCGGTCTCTTGGCCAATGCCCCGACTGTTTTAGAAGC 

CTACCAAATTGTCTCATCTATCCACCGTCGCAACAGCCTGACACCCGTTGAGCGTGAAGTGGTGCAAAT 

CACGGCAGCCGTGACCAATGGTTGTGCCTTCTGTGTCGCAGGTCACACAGCCTTTTCCATCAAACAAAT 

ccagatgaatgatgacttgattcaagctcttcgcaatcgtactccaattgaaacagatcctaaattgga 

TACCCTAGCTAAGTTTACCTTGGCAGTTATCAATACCAAGGGTCGTGTAGGAGATGAAGCCTTGTCTGA 

GTTTTTAGAAGCTGGCTACACTCAACAAAATGCCTTGGATGTGGTTTTTGGTGTCAGCCTAGCAATCCT 

CTGTAACTATGCCAACAACTTAGCTAATACACCAATTAATCCAGAATTGCAACCTTATGCC 

SP040 amino acid (SEQ ID NO:62) 

TTFTIHTVESAPAEVKEILETVEKDNNGYIPNLIGLLANAPTVLEAYQIVSSIHRRNSLTPVEREVVQI 

TAAVTNGCAFCVAGHTAFSIKQIQMNDDLIQALRNRTPIETDPKLDTLAKFTLAVTNTKGRVGDEALSE 

FLEAGYTQQNALDWFGVSLAILCNYANNLANTPINPELQPYA 

SP041 nucleotide (SEQ ID NO:63) 

GGCTAAGGAAAGAGTGGATGTACTAGCTTATAAACAGGGGTTGTTTGAAACGAGAGAGCAGGCCAAGCG 

AGGTGTGATGGCTGGCCTAGTCGTAGCAGTCCTTAATGGAGAACGGTTTGACAAGCCAGGAGAGAAAAT 

TCCAGATGACACCGAATTAAAACTCAAGGGGGAGAAACTCAAGTATGTCAGCCGTGGTGGTTTGAAACT 

GGAAAAGGCCTTGCAGGTCTTTGATTTGTCGGTGGATGGCGCGACTACGATTGATATCGGGGCCTCTAC 

TGGAGGTTTTACCGATGTCATGCTACAGAATAGTGCCAAGTTGGTCTTTGCAGTCGATGTTGGTACCAA 

TCAGTTGGCTTGGAAATTACGCCAAGACCCACGAGTTGTCAGCATGGAGCAGTTCAATTTCCGCTATGC 

TGAAAAGACTGATTTCGAGCAGGAGCCGAGCTTTGCCAGTATTGATGTGAGTTTCATTTCCCTTAGTCT 

GATTTTGCCAGCCTTGCACCGTGTCTTGGCTGATCAAGGTCAGGTGGTAGCACTTGTCAAACCTCAGTT 

TGAGGCAGGACGTGAGCAGATTGGGAAAAATGGAATTATTCGAGATGCTAAGGTTCATCAGAATGTCCT 

TGAATCTGTAACAGCTATGGCAGTAGAGGTAGGTTTTTCAGTCCTTGGCTTGGACTTTTCTCCCATCCA 

AGGTGGACATGGAAATATTGAATTTTTAGCGTATTTGAAAAAAGAAAAGTCAGCAAGCAATCAGATTCT 

TGCTGAGATTAAAGAAGCAGTAGAGAGGGCGCATAGTCAATTTAAAAATGAA 

SP041 amino acid (SEQ XD NO: 64) 

AKERVDVLAYKQGLFETREQAKRGVMAGLWAVLNGERFDKPGEKIPDDTELKLKGEKLKYVSRGGLKL 

EKALQVFDLSVDGATTIDIGASTGGFTDVMLQNSAKLVFAVDVGTNQLAWKLRQDPRWSMEQFNFRYA 

EKTDFEQEPSFASIDVSFISLSLILPALHRVLADQGQWALVKPQFEAGREQIGKNGIIRDAKVHQNVL 

ESVTAMAVEVGFSVLGLDFSPIQGGHGNIEFLAYLKKEKSASNQILAEIKEAVERAHSQFKNE 

SP042 nucleotide (SEQ ID NO: 65) 

TTGTTCCTATGAACTTGGTCGTCACCAAGCTGGTCAGGTTAAGAAAGAGTCTAATCGAGTTTCTTATAT 

AGATGGTGATCAGGCTGGTCAAAAGGCAGAAAACTTGACACCAGATGAAGTCAGTAAGAGGGAGGGGAT 

CAACGCCGAACAAATNGTNATCAAGATTACGGATCAAGGTTATGTGACCTCTCATGGAGACCATTATCA 

TTACTATAATGGCAAGGTTCCTTATGATGCCATCATCAGTGAAGAGCTCCTCATGAAAGATCCGAATTA 

TCAGTTGAAGGATTCAGACATTGTCAATGAAATCAAGGGTGGTTATGTCATTAAGGTAAACGGTAAATA 

CTATGTNTACCTTAAGGATGCAGCTCATGCGGATAATATTCGGACAAAAGAAGAGATTAAACGTCAGAA 

GCAGGAACGCAGTCATAATCATAACTCAAGAGCAGATAATGCTGTTGCTGCAGCCAGAGCCCAAGGACG 

TTATACAACGGATGATGGGTATATCTTCAATGCATCTGATATCATTGAGGACACGGGTGATGCTTATAT 

CGTTCCTCACGGCGACCATTACCATTACATTCCTAAGAATGAGTTATCAGCTAGCGAGTTAGCTGCTGC 
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AGAAGCCTATTGGAATGGGAAGCAGGGATCTCGTCCTTCTTCAAGTTCTAGTTATAATGCAAATCCAGC 

TCAACCAAGATTGTCAGAGAACCACAATCTGACTGTCACTCCAACTTATCATCAAAATCAAGGGGAAAA 

CATTTCAAGCCTTTTACGTGAATTGTATGCTAAACCCTTATCAGAACGCCATGTGGAATCTGATGGCCT 

TATTTTCGACCCAGCGCAAATCACAAGTCGAACCGCCAGAGGTGTAGCTGTCCCTCATGGTAACCATTA 

CCACTTTATCCCTTATGAACAAATGTCTGAATTGGAAAAACGAATTGCTCGTATTATTCCCCTTCGTTA 

TCGTTCAAACCATTGGGTACCAGATTCAAGACCAGAACAACCAAGTCCACAATCGACTCCGGAACCTAG 

^ & a rrTcr a rr AAA TrrTC AACC AGCTCC AAGCAATCC AATTGATGAGAAATTGGTC AA 


ACTAGCAAGAATTCACCAAGATTTACTTGATAATAAAGGTCGACAAGTTGATTTTGAGGCTTTGGATAA 

CCTGTTGGAACGACTCAAGGATGTCNCAAGTGATAAAGTCAAGTTAGTGGANGATATTCTTGCCTTCTT 

AGCTCCGATTCGTCATCCAGAACGTTTAGGAAAACCAAATGCGCAAATTACCTACACTGATGATGAGAT 

TCAAGTAGCCAAGTTGGCAGGCAAGTACACAACAGAAGACGGTTATATCTTTGATCCTCGTGATATAAC 

CAGTGATGAGGGGGATGCCTATGTAACTCCACATATGACCCATAGCCACTGGATTAAAAAAGATAGTTT 

GTCTGAAGCTGAGAGAGCGGCAGCCCAGGCTTATGCTAAAGAGAAAGGTTTGACCCCTCCTTCGACAGA 

CCATCAGGATTCAGGAAATACTGAGGCAAAAGGAGCAGAAGCTATCTACAACCGCGTGAAAGCAGCTAA 

GAAGGTGCCACTTGATCGTATGCCTTACAATCTTCAATATACTGTAGAAGTCAAAAACGGTAGTTTAAT 

CATACCTCATTATGACCATTACCATAACATCAAATTTGAGTGGTTTGACGAAGGCCTTTATGAGGCACC 

TAAGGGGTATACTCTTGAGGATCTTTTGGCGACTGTCAAGTACTATGTCGAACATCCAAACGAACGTCC 

GCATTCAGATAATGGTTTTGGTAACGCTAGCGACCATGTTCAAAGAAACAAAAATGGTCAAGCTGATAC 

CAATCAAACGGAAAAACCAAGCGAGGAGAAACCTCAGACAGAAAAACCTGAGGAAGAAACCCCTCGAGA 

a a A nf^AAC C AG AAG AAT C ACC AGAGG A 



AAAAATCCAGGAT 

SP042 amino acid (SEQ ID NO: 66) 


YYNGKVPYDAIISEELLMKDPNYQLKDSDIWEIKGGYVIKVNGKYYVY^ua^^uinx^.^^^v- 

QERSHNHNSRADNAVAAARAQGRYTTDDGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNEL5ASELAAA 

EAYWNGKQGSRPSSSSSYNANPAQPRESENHNLTVTPTYHQNQGENISSLLRELYAKPLSc.RHVES 

IFDPAQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPS 

000 /- 1 D& oMDnDa dcmptdpkt.VKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSKLAKQESLSH 


SP043 nucleotide (SEQ ID NO:67) a * a a 

TTATAAGGGTGAATTAGAAAAAGGATACCAATTTGATGGTTGGGAAATTTCTGGTTTCGAAGGTAAAAA 

agacgctggctatgttattaatctatcaaaagatacctttataaaacctgtattcaagaaaatagagga 

gaaaaaggaggaagaaaataaacctacttttgatgtatcgaaaaagaaagataacccacaagtaaacca 

tagtcaattaaatgaaagtcacagaaaagaggatttacaaagagaagagcattcacaaaaatctgattc 

aactaaggatgttacagctacagttcttgataaaaacaatatcagtagtaaatcaactactaacaatcc 

TAATAAG 

SP043 amino acid (SEQ ID NO: 68) 

YKGELEKGYQFDGWEISGFEGKKDAGYVINLSKDTFIKPVFKKXEEKKEEENKPTFDVSKKKDNPQVNH 

SQLNESHRKEDLQREEHSQKSDSTKDVTATVLDKNNISSKSTTNNPNK 


tgcgattattcttgaaagcaatggacattttgccatggtggatacaggagaagattatgatttcccaga 

tggaagtgattctcgctatccatggagagaaggaattgaaacgtcttataagcatgttctaacagaccg 

tgtctttcgtcgtttgaaggaattgggtgtccaaaaacttgattttattttggtgacccatacc 

tgatcatattggaaatgttgatgaattactgtctacctatccagttgaccgagtctatcttaagaaata 
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TAGTGATAGTCGTATTACTAATTCTGAACGTCTATGGGATAATCTGTATGGCTATGATAAGGTTTTACA 

GACTGCTGCAGAAAAAGGTGTTTCAGTTATTCAAAATATCACACAAGGGGATGCTCATTTTCAGTTTGG 

GGACATGGATATTCAGCTCTATAATTATGAAAATGAAACTGATTCATCGGGTGAATTAAAGAAAATTTG 

GGATGACAATTCCAATTCCTTGATTAGCGTGGTGAAAGTCAATGGCAAGAAAATTTACCTTGGGGGCGA 

TTTAGATAATGTTCATGGAGCAGAAGACAAGTATGGTCCTCTCATTGGAAAAGTTGATTTGATGAAGTT 

TAATCATCACCATGATACCAACAAATCAAATACCAAGGATTTCATTAAAAATTTGAGTCCGAGTTTGAT 

TGTTCAAACTTCGGATAGTCTACCTTGGAAAAATGGTGTTGATAGTGAGTATGTTAATTGGCTCAAAGA 

ACGAGGAATTGAGAGAATCAACGCAGCCAGCAAAGACTATGATGCAACAGTTTTTGATATTCGAAAAGA 

CGGTTTTGTCAATATTTCAACATCCTACAAGCCGATTCCAAGTTTTCAAGCTGGTTGGCATAAGAGTGC 

ATATGGGAACTGGTGGTATCAAGCGCCTGATTCTACAGGAGAGTATGCTGTCGGTTGGAATGAAATCGA 

AGGTGAATGGTATTACTTTAACCAAACGGGTATCTTGTTACAGAATCAATGGAAAAAATGGAACAATCA 

TTGGTTCTATTTGACAGACTCTGGTGCTTCTGCTAAAAATTGGAAGAAAATCGCTGGAATCTGGTATTA 

TTTTAACAAAGAAAACCAGATGGAAATTGGTTGGATTCAAGATAAAGAGCAGTGGTATTATTTGGATGT 

TGATGGTTCTATGAAGACAGGATGGCTTCAATATATGGGGCAATGGTATTACTTTGCTCCATCAGGGGA 

A 


SP044 amino acid (SEQ ID NO:70) 

NVQAQESSGNKIHFINVQEGGSDAIILESNGHFAMVDTGEDYDFPEX3SDSRYPWREGIETSYKHVLTDR 
VFRRLKELGVQKLDFILVTHTHSDHIGNVDELLSTYPVDRVYLKKYSDSRITNSERLWDNLYGYDKVLQ 
TAAEKGVSVIQNITQGDAHFQFGDMDIQLYNYENETDSSGELKKIWDDNSNSLISWKVNGKKIYLGGD 
LDNVHGAEDKYGPLIGKVDLMKr NHHHDTNKSNTKDFIKNLSPSL2VQTSDSLPWKNGVDSEYVNWLKE 
RGIERINAASKDYDATVFDIRKDGFVNISTSYKPIPSFQAGWHKSAYGNWWYQAPDSTGEYAVGWNEIE 

GEWYYFNQTG I LLQNQWKKWNNHWF YLTDSGA SAKNWKK I AG I WYYFNKENQME IGWI ODKEOWY v r,n\/ 
DGSMKTGWLQYMGQWYYFAPSGE * 

/ 

SP045 nucleotide (SEQ ID NO:71) 

CTTGGGTGTAACCCATATCCAGCTCCTTCCAGTCTTGTCTTACTACTTTGTCAATGAATTGAAAAACCA 

TGAACGCTTGTCTGACTACGCTTCAAGCAACAGCAACTACAACTGGGGATATGACCCTCAAAACTACTT 

CTCCTTGACTGGTATGTACTCAAGCGATCCTAAGAATCCAGAAAAACGAATCGCAGAATTTAAAAACCT 

CATCAACGAAATCCACAAACGTGGTATGGGAGCTATCCTAGATGTCGTTTATAACCACACAGCCAAAGT 

CGATCTCTTTGAAGATTTGGAACCAAACTACTACCACTTTATGGATGCCGATGGCACACCTCGAACTAG 

CTTTGGTGGTGGACGCTTGGGGACAACCCACCATATGACCAAACGGCTCCTAATTGACTCTATCAAATA 

CCTAGTTGATACCTACAAAGTGGATGGCTTCCGTTTCGATATGATGGGAGACCATGACGCCGCTTCTAT 

CGAAGAAGCTTACAAGGCTGCACGCGCCCTCAATCCAAACCTCATCATGCTTGGTGAAGGTTGGAGAAC 

CTATGCCGGTGATGAAAACATGCCTACTAAAGCTGCTGACCAAGATTGGATGAAACATACCGATACTGT 

CGCTGTCTTTTCAGATGACATCCGTAACAACCTCAAATCTGGTTATCCAAACGAAGGTCAACCTGCCTT 

TATCACAGGTGGCAAGCGTGATGTCAACACCATCTTTAAAAATCTCATTGCTCAACCAACTAACTTTGA 

AGCTGACAGCCCTGGAGATGTCATCCAATACATCGCAGCCCATGATAACTTGACCCTCTTTGACATCAT 

TGCCCAGTCTATCAAAAAAGACCCAAGCAAGGCTGAGAACTATGCTGAAATCCACCGTCGTTTACGACT 

TGGAAATCTCATGGTCTTGACAGCTCAAGGAACTCCATTTATCCACTCCGGTCAGGAATATGGACGTAC 

TAAACAATTCCGTGACCCAGCCTACAAGACTCCAGTAGCAGAGGATAAGGTTCCAAACAAATCTCACTT 

GTTGCGTGATAAGGACGGCAACCCATTTGACTATCCTTACTTCATCCATGACTCTTACGATTCTAGTGA 

TGCAGTCAACAAGTTTGACTGGACTAAGGCTACAGATGGTAAAGCTTATCCTGAAAATGTCAAGAGCCG 

TGACTATATGAAAGGTTTGATTGCCCTTCGTCAATCTACAGATGCCTTCCGACTTAAGAGTCTTCAAGA 

TATCAAAGACCGTGTCCACCTCATCACTGTCCCAGGCCAAAATGGTGTGGAAAAAGAGGATGTAGTGAT 

TGGCTACCAAATCACTGCTCCAAACGGCGATATCTACGCAGTCTTTGTCAATGCGGATGAAAAAGCTCG 

CGAATTTAATTTGGGAACTGCCTTTGCACATCTAAGAAATGCGGAAGTTTTGGCAGATGAAAACCAAGC 

AGGACCAGTCGGAATTGCCAACCCGAAAGGACTTGAATGGACTGAAAAAGGCTTGAAATTGAATGCCCT 

TACAGCTACTGTTCTTCGAGTCTCTCAAAATGGAACTAGCCATGAGTCAACTGCAGAAGAGAAACCAGA 

CTCAACCCCTTCCAAGCCTGAACATCAAAATGAAGCTTCTCACCCTGCACATCAAGACCCAGCTCCAGA 

AGCTAGACCTGATTCTACTAAACCAGATGCCAAAGTAGCTGATGCGGAAAATAAACCTAGCCAAGCTAC 

AGCTGATTCACAAGCTGAACAACCAGCACAAGAAGCACAAGCATCATCTGTAAAAGAAGCGGTTCGAAA 

CGAATCGGTAGAAAACTCTAGCAAGGAAAATATACCTGCAACCCCAGATAAACAAGCTGAA 

SP045 nucleotide (SEQ ID NO:72) 

LGVTHIQLLPVLSYYFVNELKNHERLSDYASSNSNYNWGYDPQNYFSLTGMYSSDPKNPEKRIAEFKNL 

INEIHKRGMGAILDWYNHTAKVDLFEDLEPNYYHFMDADGTPRTSFGGGRLGTTHHMTKRLLIDSIKY 

LVDTYKVDGFRFDMMGDHDAASIEEAYKAARALNPNLIMLGEGWRTYAGDENMPTKAADQDWMKHTDTV 
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AVFSDDIRNNLKSGYPNEGQPAFITGGKRDVNTIFKNLIAQPTNFEADSPGDVIQYIAAHDNLTLFDII 

AQSIKKDPSKAENYAEIHRRLRLGNLMVLTAQGTPFIHSGQEYGRTKQFRDPAYKTPVAEDKVPNKSHL 

LRDKDGNPFDYPYFIHDSYDSSDAVNKFDWTKATDGKAYPENVKSRDYMKGLIALRQSTDAFRLKSLQD 

IKDRVHLITVPGQNGVEKEDWIGYQITAPNGDIYAVFVNADEKAREFNLGTAFAHLRNAEVLADENQA 

GPVGIANPKGLEWTEKGLKLNALTATVLRVSQNGTSHESTAEEKPDSTPSKPEHQNEASHPAHQDPAPE 

ARPDSTKPDAKVADAENKPSQATADSQAEQPAQEAQASSVKEAVRNESVENSSKENIPATPDKQAE 

SP046 nucleotide (SEQ ID NO:73) 

TAGTGATGGTACTTGGCAAGGAAAACAGTATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGT 

TTTNG AT AC TC ATT ATC AATCTTGGTTC T AT AT AAAAGC AGATG C T AAC T ATGC TG AAAATG AATGGCT 

AAAGCAAGGTGACGACTATTTTTACCTCAAATCTGGTGGCTATATGGCCAAATCAGAATGGGTAGAAGA 

CAAGGGAGCCTTTTATTATCTTGACCAAGATGGAAAGATGAAAAGAAATGCTTGGGTAGGAACTTCCTA 

TGTTGGTGCAACAGGTGCCAAAGTAATAGAAGACTGGGTCTATGATTCTCAATACGATGCTTGGTTTTA 

TATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTCCAAATTAAAGGGAAGGACTATTATTTCAA 

ATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTATGTGAATGCTAGTGGTGCCAAAGT 

ACAGCAAGGTTGGCTTTTTGACAAACAATACCAATCTTGGTTTTACATCAAAGAAAATGGAAACTATGC 

TGATAAAGAATGGATTTTCGAGAATGGTCACTATTATTATCTAAAATCCGGTGGCTACATGGCAGCCAA 

TGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATTTGATGGGAAAATGGCTGAAAAAGAATG 

GGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACATGACAGCCAATGAATG 

GATTTGGGATAAGGAATCTTGGTTTTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTA 

CGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCTGGTGGCTACATGGCGAAAAATGAGACAGTAGA 

TGGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGAGGAAAAACTACAAATGAAAATGCTGCTTACTA 

TCAAGTAGTGCCTGTTACAGCCAATGTTTATGATTCAGATGGTGAAAAGCTTTCCTATATATCGCAAGG 

TAGTGTCGTATGGCTAGATAAGGATAGAAAAAGTGATGACAAGCGCTTGGCTATTACTATTTCTGGTTT 

GTCAGGCTATATGAAAACAGAAGATTTACAAGCGCTAGATGCTAGTAAGGACTTTATCCCTTATTATGA 

GAGTGATGGCCACCGTTTTTATCACTATGTGGCTCAGAATGCTAGTATCCCAGTAGCTTCTCATCTTTC 

TGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCCTGCATTTTGATGGTTTTAAGCTTGAGAA 

TCCCTTCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGAAGAATTGGATAAGGTATTTAG 

TTTGCTAAACATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAAGGAAGCCGAAGAACATTA 

CCATATCAATGCTCTTTATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTGGGGAAGAAGTAAAATTGC 

CAAAGATAAGAATAATTTCTTTGGCATTACAGCCTATGATACGACCCCTTACCTTTCTGCTAAGACATT 

TGATGATGTGGATAAGGGAATTTTAGGTGCAACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAG 

AACTTTCCTTGGAAACAAGGCTTCTGGTATGAATGTGGAATATGCTTCAGACCCTTATTGGGGCGAAAA 

AATTGCTAGTGTGATGATGAAAATCAATGAGAAGCTAGGTGGCAAAGAT 

SP046 amino acid (SEQ XD NO: 74) 

SDGTWQGKQYLKEDGSQAANEWVXDTHYQSWFYIKADANYAENEWLKQGDDYFYLKSGGYMAKSEWVED 
KGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIKADGQHAEKEWLQIKGKDYYFK 
SGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLKSGGYMAAN 
EWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVY 
DSHSQAWYYFKSGGYMAKNETVDGYQLGSDGKWLGGKTTNENAAYYQWPVTANVYDSDGEKL S Y I SQG 
SWWLDKDRKSDDKRLAITISGLSGYMKTEDLQALDASKDFIPYYESDGHRFYHYVAQNASIPVASHLS 
DMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAEELDKVFS LLN INNS LLENKGATFKEAEEHY 
HINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTFDDVDKGILGATKWIKENYIDRGR 
TFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKD 

SP048 nucleotide (SEQ ID NO: 75) 

tgggattcaataTgtcagagatgatactagagataaagaagagggaatagagtatgatgacgctgacaa 

TGGGGATATTATTGTAAAAGTAGCGACTAAAC CTAAGGT AGT AAC C AAGAAAATTTC AAGTACGCGAAT 

tcgttatgaaaaagatgaaacaaaagaccgtagtgaaaatcctgttacaattgatggagaggatggcta 

tgtaactacgacaaggacctacgatgttaatccagagactggttatgttaccgaacaggttactgttga 

tagaaaagaagccacggatacagttatcaaagttccagctaaaagcaaggttgaagaagttcttgttcc 

atttgctactaaatatgaagcagacaatgacctttctgcaggacaggagcaagagattactctaggaaa 

gaatgggaaaacagttacaacgataacttataatgtagatggaaagagtggacaagtaactgagagtac 

TTTAAGTC AAAAAAAAGACTC t C AAACAAGAGTTGTT a AAAAAAGaACC Ar kCC C CAAGTTCTTGTCCA 
AGAAATTCCAATCGAAACAGAATATCTCGATGGCCCaACTCTTGATAAAaGTCAAGAAGTAGAAGAAGT 
AGGAGAAATTGGTAAATTACTCTTACTACAATCTATACTGGTAGATGAACGTGATGGAACAATTGAAGA 
AACTACTTCTCGTCAAATTACTAAAGAGATGGTAAAAAGACGTATAAGGAGAGGGACGAGAGAACCTGA 
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^ lrt ^ Ul - AUC - TAAAGC AGTTGCTCCAACAACAGACTGGAAACAAGAAAATGGTATGTG 

^ ^ttataa^gtgatcgttgcatggcaacaggttgggtacaagttaatagttcatggtactacct 
.^^^ AACGGTTCTATGAAAGTCAATCAATGGT tccaagttggtggtaaatggtattatgtaaatac 

atcgggtgagttagcggtcaatacaagtatagatggctatagagtcaatgataatggtgaatgggtgcg 



SP049 nucleotide ( SEQ XD NO:77) 

GGATAATAGAGAAGCATTAAAAACCTTTATGACGGGTGAAAATTTTTATCTCCAACATTATCTAGGACr 

ACATAGGGAAGAACTAAATGGAGAGCATGGCTATACCTTCCGTGTTTGGGrarr''raa-rr!r-Tv^»^^^ m «„ 


11 ^uaactggattgaaaatcagattccaatggtaagaaatgattttggggt 
GGGAAGTCTTTACCAATATGGCTCAAGAAGGGCATATTTACAAATATCATGTCACACGTCAAAATGG 

I^Ir^rS^ GAAGATTGACCCTTTTGCTGTCAGGTATGAGGCTCGTCCAGGAACAGGGG CAATCGT 

ir , f , GAGG ^ TCCTGAGAAGAAATGGAAGGATGGACTTTG GCTGGCACGAAGAAAACGTTGGGGCTTTGA 

J GAGGG I^ tcaatatttatgaagttcacgctg gatcatggaaaagaaattctgatggcagtcctta 

^^^J TTGCCCAGCTCAAGGATGAACTCATTCCTTATCTCG ' rTGAAATGAACTATA CTCATATTGAGT r T’ 

„ A ^ GGTTGATGTCCCATCCT ' rTGGGCTTGAG ' rTGGGG GT ATCAGCTTA T GGGT T A CTTCGCTTTAGA 

GCATGCTTATGGCCGACCAGAGGAGTTTCAAGATTTTGTC 


^ x I 1 ^ i\^tutaggattTGACTACAAATGGAA 

GA 3! GG 2 ggatgaatgatatcctccgtttctacgaa g a agatgcgatctatcgtaaatatgactttaa 

SI GG I G J G JTI GAGCTTTATGTATGTTTNCAAGGAGAATTATCTCTTGCCAT t c t cgca cgatgaagt 

AAGAAGAGTATGATGCATAAGATGTGGG g agatcgttacaa tcaattcgcaggcttgcg 

ATACGTACCAAATTTCTCACCCTGGTAAGAAATTGCTCTTCATGGGTA gcgaatacggtca 

^ri^ TAGAATGGAAATCTGAAGAACAGTTGGAATGGTCTAACCTAGAAGACCCAAT G AA TGCTAAGAT 

GAAGTATTTCGCTTCTCAGCTAAACCAGTTTTACAAAGATCATCGCTGTCTGTGGGAAATTGATACCAn 



taaaaaggga 

SP050 amino acid (SEQ XD NO: 80) 
r^^ T ^ IGVIVDWVPXHFTINDDALAYYDGTPTFEYQDHNKAHNHGWGALNFD LGKNEVQSFLIS 

K H W I DVYH LDG I RVD A vs NMLYL D Y DD APW Tp N KDGG NLN Y E GY YF L QRLNEVIKLEYPDVMMIAEE 
\tu^vvo^3^^ KE ^ GGLGFD ^^^^ gw ^^ d ^ IjRFY ^' edp ^^^^^ df ^ Ij ^ ,f ^^^^^^enyllpfshdev 
^^^T HKMWGDRYNQFAGLRNLYTYQICHPGKKLLFMGSEY gQFlewkseeqlewsnledpmnakm 
KYFASQLNQF YKDHRCLWE I DTS YDGIEI I DADNRDQSVLSF I RKGKKG 

SP051 nucleotide (SEQ ID NO: 81) 


_9818930A2_L> 
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atctgtagtttatgcggatgaaacacttattactcatactgctgagaaacctaaagaggaaaaaatgat 

r-j,, A aoTCTGATAAAGCTTTGGAAACTAAAAATATAGTTGAAAGGACAGAACAAAGTGAACC 

^^^ScSS^tSgagnagaaagaagatgaagccgtaactccaaaagaggaaaaagt 

ggaagatgctaaagctgtaacaaatgaagaagtgaatcaaatgattgaagacaggaaagtggattttaa 

'rrAAAATTGGTACTTTAAACTCAATGCAAATTCTAAGGAAGCCATTAAACCTGATGCAGACGTATCTAC 

GTGGAAAAAATTAGATTTACCGTA'TCACTGGAGTATCTTTAACGATTTCGATCA'rcAA'rcTCCTGCACA 

AAATGAAGGTGGACAGCTCAACGGTGGGGAAGCTTGGTATCGCAAGACTTTCAAACTAGAaGAAAAAGA 

A pwnvpr. a GTCTAC ATGGATTCTC AAGTTTATGTC AATGGTC A 


TGGTATCTATCGTGATGTGACTTTACAAGTGACAGATAAGGTGCATGTTGAGAAAAATGGGACAACT 

TTTAACACCAAAACTTGAAGAACAACAACATGGCAAGGTTG^CTC^GTGACCAGCA^TCGTCAA 

TACGGACGACAAAGACCATGAACTTGTAGCCGAATATCAAATCGTTG^CGAGGTGGTCATGCTGTAAC 

rrTTAGTT CGTACAGCGAGTCGTACCTTAAAAGCACATGAATCAACAAGCCTAG 

ag?tgIScc^^ 

ttaCCGTGACGGTCAATTGGTTGATGCTAAGAAGGATTTGTTTGGTTACCGTTACTATCACTGGACTCC 

AAATGAAGGT-TCTCTTTGAATGGTGAACGTATTAAATTCCATGGAGTATCCTTGCACCACGACCAT 

^g?SggIgcagSgaaaactataaagcagaatatcgccgtctcwcaaatgaaggagatgggagt 

"AACTCCATCCGTACAACCCACAACCCTGCTAGTGAGCAAACCTTGCAAATCGCAGCAGAACTAGGT 

‘ TTCAGGAAGAGGCCTTTGATACGTGGTATGGTGGCAAGAAACCTTATGACTATGGACGTT - 

W-aaaaAGATGCCACTCACCCAGAAGCTCGAAAAGGTGAAAAATGGTCTGATTTTGACCTACGTA 

^^^^^S^ISrS!vC^A^TTCATGTCGTCAATTGGTAATt^TAGGTGAAOT 

^^GSc^AKScSSi^SSSGTTAAGGrrATCAAGGATGTlGATAAGACTCG 

TTGGGGGAAAACAGCAACCGCTTCATGGACTTTrGACCGTGACAACGC.OTCTA-GCTG^ACAG^^ 

rTGGACAGGTACGGACTATATTGGTGAACCTAC ACCATGGC ACAACC AAAATC AAA * 

CTCTTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTATCTCTACCAAA 

ii==K=rA= 

ALGAEENYKAEYSRLKQMKEMGVNSIRTTHNPASEQTLQIAAELGLLVQEEAcDTWYG - 

EKDATHPEARKGEKWSDFDLRTMVERGKNNPAIFMWSIGNEIGEANGDAHSLA VKV 

YVTMGADKFRFGNGSGGHEKXADELDAVGFNYSEDOTKALRAKHP^LIYGScTSSAT^GSYYR^ 

ELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFDRDNAGYAGQFIWTGTD-IGc. 

syfgivdtagipkhdfylyqs 

ATTCACCCTGACTGCCCACTCTGATCTCTTGAAATCGAACCAAGTCACTGTCTTaACTGGIAA 


aggacaagagaagactgttttggggacagaagtgc 


CAAAAGTACAGACCATTATTGGAGAGGCACCTGA 
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AATGCCTACCACTGTTCCGTTTGTATACAGTGATGGTAGCCGTGCAGAACGTCCTGTAACCTGGTCTTC 

AGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTAGAAGCTCG 

TGTAGAAGTGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGTATTGCTCCAAATACTGACTTGAA 

TTCTGTAGACAAATCTGTTTCCTATGTTTTGATTGATGGAAGTGTTGAAGAGTATGAAGTGGACAAGTG 

GGAGATTGCCGAAGAAGATAAAGCTAAGTTAGCAATTCCAGGTTCTCGTATTCAAGCGACCGGTTATTT 

AGAAGGTCAACCAATTCATGCAACCCTTGTGGTAGAAGAAGGCAATCCTGCGGCACCTGCAGTACCAAC 

TGTAACGGTTGGTGGTGAGGCAGTAACAGGTCTTACTAGTCAAAAACCAATGCAATACCGCACTCTTGC 

TTATGGAGCTAAGTTGCCAGAAGTCACAGCAAGTGCTAAAAATGCAGCTGTTACAGTTCTTCAAGCAAG 

CGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGATGGTGGCCCTCTTCAAACCTATGC 

AATTCAATTCCTTGAAGAAGCGCCAAAAATTGCTCACTTGAGCTTGCAAGTGGAAAAAGCTGACAGTCT 

CAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAAGCTGTATTACC 

AGCTGATAAAGTAACCTTCTCTACAAGTGGTGAAGGGGAAGTCGCAATTCGTAAAGGAATGCTTGAGTT 

GCATAAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGACCAAGTTGAACTCACTAT 

CCAAGCCAATACTGAGAAGAAGATTGCGCAATCCATCCGTCCTGTAAATGTAGTGACAGATTTGCATCA 

GGAACCAAGTCTTCCAGCAACAGTAACAGTTGAGTATGACAAAGGTTTCCCTAAAACTCATAAAGTCAC 

TTGGCAAGCTATTCCGAAAGAAAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAAAGTTGAAGG 

AATTGACCTTGAAGCGCGTGCAAAAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGAC 

AACTCCAATCGCAGAAGCACCACAATTACCAGAAAGTGTTCGGACATATGATTCAAATGGTCACGTTTC 

ATCAGCTAAGGTTGCATGGGATGCGATTCGTCCAGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAA 

TGGTCGCTTAGAAGGTACGCAATTAACA 

SP052 amino acid (SEQ ID NO: 84) 

YFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWNWENKELASKVADSEGKIPVRAYSNASSVELr 

LNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQPGTLEAIARDESGKEIARDKITTAGKPAA 

VRLIKEDHAIAADGKDLTYIYYEIVDSQGNWPTANNLVRFQLHGQGQLVGVDNGEQASRERYKAQADG 

swirkafngkgvaivksteqagkftltahsdLlxsnqvtvftgkkegqektvlgtevpkvqtiigeape 

MPTTVPFVYSDGSRAERPVTWSSVDVSKPGIVTVKGMADGREVEARVEVIALKSELPWKRIAPNTDLN 
SVDKSVSYVLIDGSVEEYEVDKWEIAEEDKAKLAIPGSRIQATGYLEGQPIHATLWEEGNPAAPAVPT 
VTVGGEAVTGLTS QK PMQ YRTLA YGAKL P EVTA S AKNAA VTVLQAS AANGMRAS I F I QP KDGG PLQT YA 
IQFLEEAPKIAHLSLQVEKADSLKEDQTVKLSVRAHYQDGTQAVLPADKVTFSTSGEGEVAIRKGMLEL 
HKPGAVTLNAEYEGAKDQVELTIQANTEKKIAQSIRPVNWTDLHQEPSLPATVTVEYDKGFPKTHKVT 
WQAIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTPIAEAPQLPESVRTYDSNGHVS 
SAKVAWDAIRPEQYAKEGVFTVNGRLEGTQLT 

SPO 5 3 nucleotide <SEQ ID NO: 85) 

AGCTAAGGTTGCATGGGATGCGATTCGTCCAGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGG 

TCGCTTAGAAGGTACGCAATTAACAACTAAACTTCATGTTCGCGTATCTGCTCAAACTGAGCAAGGTGC 

AAACATTTCTGACCAATGGACCGGTTCAGAATTGCCACTTGCCTTTGCTTCAGACTCAAATCCAAGCGA 

CCCAGTTTCAAATGTTAATGACAAGCTCATTTCCTACAATAACCAACCAGCCAATCGTTGGACAAACTG 

GAATCGTACTAATCCAGAAGCTTCAGTCGGTGTTCTGTTTGGAGATTCAGGTATCTTGAGCAAACGCTC 

CGTTGATAATCTAAGTGTCGGATTCCATGAAGACCATGGAGTTGGTGTACCGAAGTCTTATGTGATTGA 

GTATTATGTTGGTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGGACCATGT 

CTTTAATGATTCTGCCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTGCTCAACTCAAGGCTGGAGA 

AATGAACCACTTTAGCTTTGATAAAGTTGAAACCTATGCTGTTCGTATTCGCATGGTTAAAGCAGATAA 

CAAGCGTGGAACGTCTATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAGCAAGGACA 

AACAAGAATCCAAGTTGACGGCAAAGACTTAGCAAACTTCAACCCTGATTTGACAGACTACTACCTTGA 

GTCTGTAGATGGAAAAGTTCCGGCAGTCACAGCAAGTGTTAGCAACAATGGTCTCGCTACCGTCGTTCC 

AAGCGTTCGTGAAGGTGAGCCAGTTCGTGTCATCGCGAAAGCTGAAAATGGCGACATCTTAGGAGAATA 

CCGTCTGCACTTCACTAAGGATAAGAGCTTACTTTCTCATAAACCAGTTGCTGCGGTTAAACAAGCTCG 

CTTGCTACAAGTAGGTCAAGCACTTGAATTGCCGACTAAGGTTCCAGTTTACTTCACAGGTAAAGACGG 

CTACGAAACAAAAGACCTGACAGTTGAATGGGAAGAAGTTCCAGCGGAAAATCTGACAAAAGCAGGTCA 

ATTTACTGTTCGAGGCCGTGTCCTTGGTAGTAACCTTGTTGCTGAGATCACTGTACGAGTGACAGACAA 

ACTTGGTGAGACTCTTTCAGATAACCCTAACTATGATGAAAACAGTAACCAGGCCTTTGCTTCAGCAAC 

CAATGATATTGACAAAAACTCTCATGACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATCG 

TCGTTGGACAAACTGGTCACCAACACCATCTTCTAATCCAGAAGTATCAGCGGGTGTGATTTTCCGTGA 

AAATGGTAAGATTGTAGAACGGACTGTTACACAAGGAAAAGTTCAGTTCTTTGCAGATAGTGGTACGGA 

TGCACCATCTAAACTCGTTTTAGAACGCTATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAA 

CTACCAAGCCTACGACGCAGACCATCCATTCAACAATCCAGAAAATTGGGAAGCTGTTCCTTATCGTGC 
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GGATAAAGACATTGCAGCTGGTGATGAAATCAACGTAACATTTAAAGCTATCAAAGCCAAAGCTATGAG 

ATGGCGTATGGAGCGTAAAGCAGATAAGAGCGGTGTTGCGATGATTGAGATGACCTTCCTTGCACCAAG 

TGAATTGCCTCAAGAAAGCACTCAATCAAAGATTCTTGTAGATGGAAAAGAACTTGCTGATTTCGCTGA 

AAATCGTCAAGACTATCAAATTACCTATAAAGGTCAACGGCCAAAAGTCTCAGTTGAAGAAAACAATCA 

AGTAGCTTCAACTGTGGTAGATAGTGGAGAAGATAGCTTTCCAGTACTTGTTCGCCTCGTTTCAGAAAG 

TGGAAAACAAGTCAAGGAATACCGTATCCACTTGACTAAGGAAAAACCAGTTTCTGAGAAGACAGTTGC 

TGCTGTACAAGAAGATCTTCCAAAAATCGAATTTGTTGAAAAAGATTTGGCATACAAGACAGTTGAGAA 

AAAAGATTCAACACTGTATCTAGGTGAAACTCGTGTAGAACAAGAAGGAAAAGTTGGAAAAGAACGTAT 

CTTTACAGCGATTAATCCTGATGGAAGTAAGGAAGAAAAACTCCGTGAAGTGGTAGAAGTTCCGACAGA 

CCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAAGAAGCTAAAAAACCACAAGTGTCAGAAAAAGC 

AGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAACTAATAAAGCCCAG 

SP053 amino acid (SEQ ID NO: 86) 

AKVAWDAIRPEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANISDQWTGSELPLAFASDSNPSD 
PVSNVNDKLISYNNQPANRWTNWNRTNPEASVGVLFGDSGILSKRSVDNLSVGFHEDHGVGVPKSYVIE 
YYVGKTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMNHFSFDKVETYAVRIRMVKADN 
KRGTSITEVQIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGKVPAVTASVSNNGLATWP 
SVREGEPVRVIAKAENGDILGEYRLHFTKDKSLLSHKPVAAVKQARLLQVGQALELPTKVPVYFTGKDG 
YETKDLTVEWEEVPAENLTKAGQFTVRGRVLGSNLVAEITVRVTDKLGETLSDNPNYDENSNQAFASAT 
NDIDKNSHDRVDYLNDGDHSENRRWTNWSPTPSSNPEVSAGVIFRENGKIVERTVTQGKVQFFADSGTD 
APSKLVLERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYRADKDIAAGDEINVTFKAIKAKAMR 
WRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQITYKGQRPKVSVEENNQ 
VASTVVDSGEDSFPVLVRLVSESGKQVKEYRIHLTKEKPVSEKTVAAVQEDLPKIEFVEKDLAYKTVEK 
KDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKLREWEVPTDRIVLVGTKPVAQEAKKPQVSEKA 
DTKP IDS S EASQTNKAQ 

SP054 nucleotide (SEQ ID NO: 87) 

CTATCACTATGTAAATAAAGAGATTATTTCACAAGAAGCTAAAGATTTAATTCAGACAGGAAAGCCTGA 

CAGGAATGAAGTTGTATATGGTTTGGTGTATCAAAAAGATCAGTTGCCTCAAACAGGGACAGAA 

SP054 amino acid (SEQ ID NO: 88) 

YHYVNKE 1 1 SQEAKDL I QTGKPDRNEWYGLVYQKDQL PQTGTE 

SP055 nucleotide (SEQ ID NO:89) 

TGAGACTCCTCAATCAATAACAAATCAGGAGCAAGCTAGGACAGAAAACCAAGTAGTAGAGACAGAGGA 

AGCTCCAAAAGAAGAAGCACCTAAAACAGAAGAAAGTCCAAAGGAAGAACCAAAATCGGAGGTAAAACC 

TACTGACGACACCCTTCCTAAAGTAGAAGAGGGGAAAGAAGATTCAGCAGAACCAGCTCCAGTTGAAGA 

AGTAGGTGGAGAAGTTGAGTCAAAACCAGAGGAAAAAGTAGCAGTTAAGCCAGAAAGTCAACCATCAGA 

CAAACCAGCTGAGGAATCAAAAGTTGAACAAGCAGGTGAACCAGTCGCGCCAAGAGAAGACGAAAAGGC 

ACCAGTCGAGCCAGAAAAGCAACCAGAAGCTCCTGAAGAAGAGAAGGCTGTAGAGGAAACACCGAAACA 

AGAAGAGTCAACTCCAGATACCAAGGCTGAAGAAACTGTAGAACCAAAAGAGGAGACTGTTAATCAATC 

TATTGAACAACCAAAAGTTGAAACGCCTGCTGTAGAAAAACAAACAGAACCAACAGAGGAACCAAAAGT 

TGAACAAGCAGGTGAACCAGTCGCGCCAAGAGAAGACGAACAGGCACCAACGGCACCAGTTGAGCCAGA 

AAAGCAACCAGAAGTTCCTGAAGAAGAGAAGGCTGTAGAGGAAACACCGAAACCAGAAGATAAAATAAA 

GGGTATTGGTACTAAAGAACCAGTTGATAAAAGTGAGTTAAATAATCAAATTGATAAAGCTAGTTCAGT 

TTCTCCTACTGATTAT 

SP055 amino acid (SEQ ID NO: 90) 

ETPQSITNQEQARTENQWETEEAPKEEAPKTEESPKEEPKSEVKPTDDTLPKVEEGKEDSAEPAPVEE 

VGGEVESKPEEKVAVKPESQPSDKPAEESKVEQAGEPVAPREDEKAPVEPEKQPEAPEEEKAVEETPKQ 

EESTPDTKAEETVEPKEETVNQSIEQPKVETPAVEKQTEPTEEPKVEQAGEPVAPREDEQAPTAPVEPE 

KQPEVPEEEKAVEETPKPEDKIKGIGTKEPVDKSELNNQIDKASSVSPTDY 

SP056 nucleotide (SEQ ID NO: 91) 

GGATGCTCAAGAAACTGCGGGAGTTCACTATAAATATGTGGCAGATTCAGAGCTATCATCAGAAGAAAA 
GAAGC AGCTTGTCTATGAT ATTC CG AC ATACGTGG AGAATGATG ATGAAACTT ATTATCTTGTTTATAA 
GTTAAATTCTCAAAATCAACTGGCGGAATTGCCAAATACTGGAAGCAAGAATGAGAGGCAA 
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SP056 amino acid (SEQ ID NO:92) 

DAQETAGVHYKYVADS ELS S EEKKQLVYD I PTYVENDDETYYLVYKLNSQNQ LAELPNTG S KNERQ 

SP057 nucleotide (SEQ ID NO: 93) 

CGACAAAGGTGAGACTGAGGTTCAACCAGAGTCGCCAGATACTGTGGTAAGTGATAAAGGTGAACCAGA 

GCAGGTAGCACCGCTTCCAGAATATAAGGGTAATATTGAGCAAGTAAAACCTGAAACTCCGGTTGAGAA 

GACCAAAGAACAAGGTCCAGAAAAAACTGAAGAAGTTCCAGTAAAACCAACAGAAGAAACACCAGTAAA 

TCCAAATGAAGGTACTACAGAAGGAACCTCAATTCAAGAAGCAGAAAATCCAGTTCAACCTGCAGAAGA 

ATCAACAACGAATTCAGAGAAAGTATCACCAGATACATCTAGCAAAAATACTGGGGAAGTGTCCAGTAA 

TCCTAGTGATTCGACAACCTCAGTTGGAGAATCAAATAAACCAGAACATAATGACTCTAAAAATGAAAA 

TTCAGAAAAAACTGTAGAAGAAGTTCCAGTAAATCCAAATGAAGGCACAGTAGAAGGTACCTCAAATCA 

AGAAACAGAAAAACCAGTTCAACCTGCAGAAGAAACACAAACAAACTCTGGGAAAATAGCTAACGAAAA 

TACTGGAGAAGTATCCAATAAACCTAGTGATTCAAAACCACCAGTTGAAGAATCAAATCAACCAGAAAA 

AAACGGAACTGCAACAAAACCAGAAAATTCAGGTAATACAACATCAGAGAATGGACAAACAGAACCAGA 

ACCATCAAACGGAAATTCAACTGAGGATGTTTCAACCGAATCAAACACATCCAATTCAAATGGAAACGA 

AGAAATTAAACAAGAAAATGAACTAGACCCTGATAAAAAGGTAGAAGAACCAGAGAAAACACTTGAATT 

AAGAAAT 

SP057 amino acid (SEQ ID NO: 94) 

DKGETEVQPESPDTWSDKGEPEQVAPLPEYKGNIEQVKPETPVEKTKEQGPEKTEEVPVKPTEETPVN 

PNEGTTEGTSIQEAENPVQPAEESTTNSEKVSPDTSSKNTGEVSSNPSDSTTSVGESNKPEHNDSKNEN 

SEKTVEEVPVNPNEGTVEGTSNQETEKPVQPAEETQTNSGKIANENTGEVSNKPSDSKPPVEESNQPEK 

NGTATKPENSGNTTSENGQTEPEPSNGNSTEDVSTESNTSNSNGNEEIKQENELDPDKKVEEPEKTLEL 

RN 

/ 

SP058 nucleotide (SEQ ID NO:95) 

AAATCAATTGGTAGCACAAGA7CCAAAAGCACAAGATAGCACTAAACTGACTGCTGAAAAATCAACTGT 
TAAAGCACCTGCTCAAAGAGTAGATGTAAAAGATATAACTCATTTAACAGATGAAGAAAAAGTTAAGGT 
TGCTATTTTACAAGCAAATGGTTCAGCATTAGACGGAGCGACAATCAATGTAGCTGGAGATGGTACAGC 
AACAATCACATTCCCAGATGGTTCAGTAGTGACGATTCTAGGAAAAGATACAGTTCAACAATCTGCGAA 
AGGTGAATCTGTAACTC AAGAAGC TACACC AGAGTATAAGCT AGAAAAT AC AC CAGGTGGAGATAAGGG 
AGGCAATACTGGAAGCTCAGATGCTAATGCGAATGAAGGCGGTGGTAGCCAGGCGGGTGGATCAGCTCA 
CACAGGTTCACAAAACTCAGCTCAATCACAAGCTTCTAAGCAATTAGCTACTGAAAAAGAATCAGCTAA 
AAATGCCATTGAAAAAGCAGCCAAGGACAAGCAGGATGAAATCAAAGGCGCACCGCTTTCTGATAAAGA 
AAAAGCAGAACTTTTAGCAAGAGTGGAAGCAGAAAAACAAGCAGCTCTCAAAGAGATTGAAAATGCGAA 
AACTATGGAAGATGTGAAGGAAGCAGAAACGATTGGAGTGCAAGCCATTGCCATGGTTACAGTTCCTAA 
GAGACCAGTGGCTCCTAAT 

SP058 amino acid (SEQ ID NO:96) 

NQLVAQDPKAQDSTKLTAEKSTVKAPAQRVDVKDITHLTDEEKVKVAILQANGSALDGATINVAGDGTA 

TITFPDGSWTILGKDTVQQSAKGESVTQEATPEYKLENTPGGDKGGNTGSSDANANEGGGSQAGGSAH 

TGSQNSAQSQASKQLATEKESAKNAIEKAAKDKQDEIKGAPLSDKEKAELLARVEAEKQAALKEIENAK 

TMEDVKEAETIGVQAIAMVTVPKRPVAPN 

SP059 nucleotide (SEQ ID NO: 97) 

CAAACAGTCAGCTTCAGGAACGATTGAGGTGATTTCACGAGAAAATGGCTCTGGGACACGGGGTGCCTT 

CACAGAAATCACAGGGATTCTCAAAAAAGACGGTGATAAAAAAATTGACAACACTGCCAAAACAGCTGT 

GATTCAAAATAGTACAGAAGGTGTTCTCTCAGCAGTTCAAGGGAATGCTAATGCTATCGGCTACATCTC 

CTTGGGATCTTTAACGAAATCTGTCAAGGCTTTAGAGATTGATGGTGTCAAGGCTAGTCGAGACACAGT 

TTTAGATGGTGAATACCCTCTTCAACGTCCCTTCAACATTGTTTGGTCTTCTAATCTTTCCAAGCTAGG 

TCAAGATTTTATCAGCTTTATCCACTCCAAACAAGGTCAACAAGTGGTCACAGATAATAAATTTATTGA 

AGCTAAAACCGAAACCACGGAATATACAAGCCAACACTTATCAGGCAAGTTGTCTGTTGTAGGTTCCAC 

TTCAGTATCTTCTTTAATGGAAAAATTAGCAGAAGCTTATAAAAAAGAAAATCCAGAAGTTACGATTGA 

TATTACCTCTAATGGGTCTTCAGCAGGTATTACCGCTGTTAAGGAGAAAACCGCTGATATTGGTATGGT 

TTCTAGGGAATTAACTCCTGAAGAAGGTAAGAGTCTCACCCATGATGCTATTGCTTTAGACGGTATTGC 

TGTTGTGGTCAATAATGACAATAAGGCAAGCCAAGTCAGTATGGCTGAACTTGCAGACGTTTTTAGTGG 

CAAATTAACCACCTGGGACAAGATTAAA 
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SP0 5 9 amino acid (SEQ ID NO: 98) 

KQSASGTIEVISRENGSGTRGAFTEITGILKKDGDKKIDNTAKTAVIQNSTEGVLSAVQGNANAIGYIS 

LGSLTKSVKALEXDGVKASRDTVLDGEYPLQRPFNIVWSSMLSKLGQDFISFIHSKQGQQWTDNKFIE 

AKTETTEYTSQHLSGKLSWGSTSVSSLMEKLAEAYKKENPEVTIDITSNGSSAGITAVKEKTADIGMV 

SRELTPEEGKSLTHDAIALDGIAWVNNDNKASQVSMAELADVFSGKLTTWDKIK 

SP060 nucleotide (SEQ ID NO: 99) 

ATTCGATGATGCGGATGAAAAGATGACCCGTGATGAAATTGCCTATATGCTGACAAATAGTGAAGAAAC 

ATTGGATGCTGATGAGATTGAGATGCTACAAGGTGTCTTTTCGCTCGATGAACTGATGGCACGAGAGGT 

TATGGTTCCTCGAACGGATGCCTTTATGGTGGATATTCAGGATGATAGTCAAGCCATTATCCAAAGTAT 

TTTAAAACAAAATTATTCTCGTATCCCGGTTTATGATGGGGATAAGGACAATGTAATTGGAATCATTCA 

CACCAAGAGTCTCCTTAAGGCAGGCTTTGTGGACGGTTTTGACAATATTGTTTGGAAGAGAATTTTACA 

AGATCCACTTTTTGTACCTGAAACTATTTTTGTGGATGACTTGCTAAAAGAACTGCGAAATACCCAAAG 

ACAAATG 

SP060 amino acid (SEQ ID NO: 100) 

FDDADEKMTRDEIAYMLTNSEETLDADEIEMLQGVFSLDELMAREVMVPRTDAFMVDIQDDSQAIIQSI 

LKQNYSRIPVYDGDKDNVIGIIHTKSLLKAGFVDGFDNIVWKRILQDPLFVPETIFVDDLLKELRNTQR 

QM 

SP062 nucleotide (SEQ ID NO: 101) 

GGAGAGTCGATCAAAAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGACTCATCTTCTTCGTCAAGTTC 
AGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAACAAGCCGACAGAACCAGGAGAAAA 
GGTAGCAGAAGCTAAGAAGAAGGTTGAAGAAGCTGAGAAAAAAGCCAAGGATCAAAAAGAAGAAGATCG 
TCGTAACTACCCAACCATTACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAA 
AAAAGCGGAGC TTGAACTAGT AAAAGTGAAAGCTAACGAACC TCGAGAC G AGC AA 

SP062 amino acid (SEQ ID NO:102> 

ESRSKVDEAVSKFEKDSSSSSSSDSSTKPEASDTAKPNKPTEPGEKVAEAKKKVEEAEKKAKDQKEEDR 

RNYPTITYKTLELEIAESDVEVKKAELELVKVKANEPRDEQ 

SP063 nucleotide (SEQ ID NO;103) 

ATGGACAACAGGAAACTGGGACGAGGTTATATCTGGTAAGATTGACAAGTACAAAGATCCAGATATTCC 
AACAGTTGAATCACAAGAAGTTACGTCAGACTCTAGTGATAAAGAAATAACGGTAAGGTATGACCGTTT 
ATCAACACCAGAAAAACCAATCCCACAACCAAATCCAGAGCATCCAAGTGTTCCGACACCAAACCCAGA 
ACTACCAAATCAAGAGACTCCAACACCAGATAAACCAACTCCAGAACCAGGTACTCCAAAAACTGAAAC 
TCCAGTGAATCCAGACCCAGAAGTTCCGACTTATGAGACAGGTAAGAGAGAGGAATTGCCAAACACAGG 
TAC AGAAGC T AAT 

SP063 amino acid (SEQ ID NO: 104) 

WTTGNWDEVISGKIDKYKDPDIPTVESQEVTSDSSDKEITVRYDRLSTPEKPIPQPNPEHPSVPTPNPE 

LPNQETPTPDKPTPEPGTPKTETPVNPDPEVPTYETGKREELPNTGTEAN 

SP064 nucleotide (SEQ ID NO: 105) 

CGATGGGCTCAATCCAACCCCAGGTCAAGTCTTACCTGAAGAGACATCGGGAACGAAAGAGGGTGACTT 

ATCAGAAAAACCAGGAGACACCGTTCTCACTCAAGCGAAACCTGAGGGCGTTACTGGAAATACGAATTC 

ACTTCCGACACCTACAGAAAGAACTGAAGTGAGCGAGGAAACAAGCCCTTCTAGTCTGGATACACTTTT 

TGAAAAAGATGAAGAAGCTCAAAAAAATCCAGAGCTAACAGATGTCTTAAAAGAAACTGTAGATACAGC 

TGATGTGGATGGGACACAAGCAAGTCCAGCAGAAACTACTCCTGAACAAGTAAAAGGTGGAGTGAAAGA 

AAATACAAAAGACAGCATCGATGTTCCTGCTGCTTATCTTGAAAAAGCTGAAGGGAAAGGTCCTTTCAC 

TGCCGGTGTAAACCAAGTAATTCCTTATGAACTATTCGCTGGTGATGGTATGTTAACTCGTCTATTACT 

AAAAGCTTCGGATAATGCTCCTTGGTCTGACAATGGTACTGCTAAAAATCCTGCTTTACCTCCTCTTGA 

AGGATTAACAAAAGGGAAATACTTCTATGAAGTAGACTTAAATGGCAATACTGTTGGTAAACAAGGTCA 

AGCTTTAATTGATCAACTTCGCGCTAATGGTACTCAAACTTATAAAGCTACTGTTAAAGTTTACGGAAA 

TAAAGACGGTAAAGCTGACTTGACTAATCTAGTTGCTACTAAAAATGTAGACATCAACATCAATGGATT 

AGTTGCTAAAGAAACAGTTCAAAAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGC 

CTACCTAGAAAAAGCCAAGGGTGAAGGTCCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACT 

CTTCGCAGGTGATGGCATGTTGACTCGTCTCTTGCTCAAGGCATCTGACAAGGCACCATGGTCAGATAA 
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CGGCGACGCTAAAAACCCAGCCCTATCTCCACTAGGCGAAAACGTGAAGACCAAAGGTCAATACTTCTA 

TCAANTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACAAGCGCTCATTGACCAGTTCCGAGCAAA 

NGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGACGGTAAACCAGACTTGGACAA 

CATCGTAGCAACTAAAAAAGTCACTATTAACATAAACGGTTTAATTTCTAAAGAAACAGTTCAAAAAGC 

CGTTGCAGACAACGTTAANGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTGAAGG 

TCCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTGATGGTATGTTGACTCG 

TCTCTTGCTCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGNGACGCTAAAAACCCAGCNCTATC 

TCCACTAGGTGAAAACGTGAAGACCAAAGGTCAATACTTCTATCAANTAGCCTTGGACGGAAATGTAGC 

TGGCAAAGAAAAACAAGCGCTCATTGACCAGTTCCGAGCAAACGGTACTCAAACTTACAGCGCTACAGT 

CAATGTCTATGGTAACAAAGACGGTAAACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTAT 

TAAGATAAATGTTAAAGAAACATCAGACACAGCAAATGGTTCATTATCACCTTCTAACTCTGGTTCTGG 

CGTGACTCCGATGAATCACAATCATGCTACAGGTACTACAGATAGCATGCCTGCTGACACCATGACAAG 

TTCTACCAACACGATGGCAGGTGAAAACATGGCTGCTTCTGCTAACAAGATGTCTGATACGATGATGTC 

AGAGGATAAAGCTATG 

SP064 amino acid {SEQ ID NO: 106) 

DGLNPTPGQVLPEETSGTKEGDLSEKPGDTVLTQAKPEGVTGNTNSLPTPTERTEVSEETSPSSLDTLF 

EKDEEAQKNPELTDVLKETVDTADVDGTQASPAETTPEQVKGGVKENTKDSIDVPAAYLEKAEGKGPFT 

AGVNQVIPYELFAGDGMLTRLLLKASDNAPWSDNGTAKNPALPPLEGLTKGKYFYEVDLNGNTVGKQGQ 

ALIDQLRANGTQTYKATVKVYGNKDGKADLTNLVATKNVDININGLVAKETVQKAVADNVKDSIDVPAA 

YLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFY 

QXALDGNVAGKEKQALIDQFRAXGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKA 

VADNVXDSIDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALS 

PLGENVKTKGQYFYQXALDGNVAGKEKQALIDQFRANGTQTYSATVWYGNKDGKPDLDNIVATKKVTI 

KINVKETSDTANGSLSPSNSGSGVTPMNHNHATGTTDSMPADTMTSSTNTMAGENMAASANKMSDTMMS 

EDKAM 

SP065 nucleotide (SEQ ID NO:107> 

TTCCAATCAAAAACAGGCAGATGGTAAACTCAATATCGTGACAACCTTTTACCCTGTCTATGArTTTAC 

CAAGCAAGTCGCAGGAGATACGGCTAATGTAGAACTCCTAATCGGTGCTGGGACAGAACCTCATGAATA 

CGAACCATCTGCCAAGGCAGTTGCCAAAATCCAAGATGCAGATACCTTCGTTTATGAAAATGAAAACAT 

GGAAACATGGGTACCTAAATTGCTAGATACCTTGGATAAGAAAAAAGTGAAAACCATCAAGGCGACAGG 

CGATATGTTGCTCTTGCCAGGTGGCGAGGAAGAAGAGGGAGACCATGACCATGGAGAAGAAGGTCATCA 

CCATGAGTTTGACCCCCATGTTTGGTTATCACCAGTTCGTGCCATtAAACTAGTAGAGCACCATCCGCG 

ACACTTGTCAGCAGATTATCCTGATAAAAAAGAGACCTTTGAGAAGAATGCAGCTGCCTATATCGAAAA 

ATTGCAAGCCTTGGATAAGGCTTACGCAGAAGGTTTGTCTCAAGCAAAACAAAAGAGCTTTGTGACTCA 

ACACGCAgCCTTTAACTaTCTTGCCTTGGACTATGGGACTC 

SP065 amino acid <SEQ ID 110:108) 

SNQKQADGKLNIVTTFYPVTEFTKQVAGDTANVELLIGAGTEPHEYEPSAKAVAKIQDADTFVYENENM 

ETWVPKLLDTLDKKKVKTIKATGDMLLLPGGEEEEGDHDHGEEGHHHEFDPHVWLSPVRAIKLVEHHPR 

HLSADYPDKKETFEK^AAAYIEKLQALDKAYAEGLSQAKQKSFVTQHAAFNYLALDYGT 

SP067 nucleotide (SEQ ID NO: 109) 

TATCACAGGATCGAACGGTAAGACAACCACAACGACTATGATTGGGGAAGTTTTGACTGCTGCTGGCCA 

ACATGGTCTTTTATCAGGGAATATCGGCTATCCAGCTAGTCAGGTTGCTCAAATAGCATCAGATAAGGA 

CACGCTTGTTATGGAACTTTCTTCTTTCCAACTCATGGGTGTTCAAGAATTCCATCCAGAGATTGCGGT 

TATTACCAACCTCATGCCAACTCATATCGACTACCATGGGTCATTTTCGGAATATGTAGCAGCCAAGTG 

GAATATCCAGAACAAGATGACAGCAGCTGATTTCCTTGTCTTGAACTTTAATCAAGACTTGGCAAAAGA 

CTTGACTTCCAAGACAGAAGCCACTGTTGTACCATTTTCAACACTTGAAAAGGTTGATGGAGCTTATCT 

GGAAGATGGTCAACTCTACTTCCGTGGTGAAGTAGTCATGGCAGCGAATGAAATCGGTGTTCCAGGTAG 

CCACAATGTGGAAAATGCCCTTGCGACTATTGCTGTAGCCAAGCTTCGTGATGTGGACAATCAAACCAT 

CAAGGAAACTCTTTCAGCCTTCGdTGGTGTCAAACACCGTCTCCAGTTTGTGGATGACATCAAGGGTGT 

TAAATTCTATAACGACAGTAAATCAACTAATATCTTGGCTACTCAAAAAGCCTTGTCAGGATTTGACAA 

CAGCAAGGTCGTCTTGATTGCAGGTGGTTTGGACCGTGGCAATGAGTTTGACGAATTGGTGCCAGACAT 

TACTGGACTCAAGAAGATGGTCATCCTGGGTCAATCTGCAGAACGTGTCAAACGGGCAGCAGACAAGGC 

TGGTGTCGCTTATGTGGAGGCGACAGATATTGCAGATGCGACCCGCAAGGCCTATGAGCTTGCGACTCA 
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AGGAGATGTGGTTCTTCTTAGTCCTGCCAATGCTAGCTGGGATATGTATGCTAACTTTGAAGTACGTGG 

CGACCTCTTTATCGACACAGTAGCGGAGTTAAAAGAA 

SP067 amino acid (SEQ ID NO: 110) 

GITGSNGKTTTTTMIGEVLTAAGQHGLLSGNIGYPASGVAQIASDKDTLVMELSSFQLMGVQEFHPEIA 
VITNLMPTHIDYHGSFSEYVAAKWNIQNKMTAADFLVLNFNQDLAKDLTSKTEATWPFSTLEKVDGAY 
LEDGQ L YFRG EWMAANEIG VPGS HNVENALAT I AVAKLRDVDNQT I KETLSAFGGVKHRLQFVDD I KG 
VKFYNDSKSTNILATQKALSGFDNSKVVLIAGGLDRGNEFDELVPDITGLKKMVILGQSAERVKRAADK 
AGVAYVEATD I ADATRKA YELATQGDWLLS P ANASWDMYANF EVRGDL F I DTVAELKE 

SP068 nucleotide (SEQ ID NO: 111) 

AAGTTCATCGAAGATGGTTGGGAAGTCCACTATATCGGGGACAAGTGTGGTATCGAACACCAAGAAATC 

CTTAAGTCAGGTTTGGATGTCACCTTCCATTCTATTGCGACTGGAAAATTGCGTCGCTATTTCTCTTGG 

CAAAATATGCTGGACGTCTTCAAAGTTGGTTGGGGAATTGTCCAATCGCTCTTTATCATGTTGCGACTG 

CGTCCACAGACCCTTTTTTCAAAGGGGGGCTTTGTCTCAGTACCGCCTGTTATCGCTGCGCGTGTGTCA 

GGAGTGCCTGTCTTTATTCACGAATCTGACCTGTCTATGGGCTTGGCCAATAAAATCGCCTATAAATTT 

GCGACTAAGATGTATTCAACCTTTGAACAAGCTTCGAGTTTGGCTAAGGTTGAGCATGTGGGAGCGG 

SPO 68 amino acid (SEQ ID NO: 112) 

SSSKMVGKSTISGTSWSNTKKSLSQVWMSPSILLRLENCVAISLGKICWTSSKLVGELSNRSLSCCDC 

VHRPFFQRGALSQYRLLSLRVCQECLSLFTNLTCLWAWPIKSPINLRLRCIQPLNKLRVWLRLSMWER 

SP069 nucleotide (SEQ ID NO: 113) 

ATCGCTAGCTAGTGAAATGCAAGAAAGTACACGTAAATTCAAGGTTACTGCTGACCTAACAGATGCCGG 

TGTTGGAACGATTGAAGTTCCTTTGAGCATTGAAGATTTACCCAATGGGCTGACCGCTGTGGCGACTCC 

GCAAAAAATTACAGTCAAGATTGGTAAGAAGGCTCAGAAGGATAAGGTAAAGATTGTACCAGAGATTGA 

CCCTAGTCAAATTGATAGTCGGGTACAAATTGAAAATGTCATGGTGTCAGATAAAGAAGTGTCTATTAC 

GAGTGACCAAGAGACATTGGATAGAATTGATAAGATTATCGCTGTTTTGCCAACTAGCGAACGTATAAC 

AGGTAATTACAGTGGTTCAGTACCTTTGCAGGCAATCGACCGCAATGGTGTTGTCTTACCGGCAGTTAT 

CACTCCGTTTGATACAATAATGAAGGTGACTACAAAACCAGTAGCACCAAGTTCAAGCACATCAAATTC 

AAGTACAAGCAGTTCATCGGAGACATCTTCGTCAACGAAAGCAACTAGTTCAAAAACGAAT 

SPO 69 amino acid (SEQ ID NO:114) 

SLASEMQESTRKFKVTADLTDAGVGTIEVPLSIEDLPNGLTAVATPQKITVKXGKKAQKDKVKIVPEXD 

PSQIDSRVQIENVMVSDKEVSITSDQETLDRIDKIIAVLPTSERITGNYSGSVPLQAIDRNGWLPAVI 

TPFDTIMKVTTKPVAPSSSTSNSSTSSSSETSSSTKATSSKTN 

SP070 nucleotide (SEQ ID NO: 115) 

GCACCAGATGGGGCACAAGGTTCAGGGATCAGATGTTGAAAAGTACTACTTTACCCAACGCGGTCTTGA 

GCAGGCAGGAATTACCATTCTTCCTTTTGATGAAAAAAATCTAGACGGTGATATGGAAATTATCGCTGG 

AAATGCCTTTCG'TCCAGATAACAACGTCGAAATTGCCTATGCGGACCAAAATGGTATCAGCTACAAACG 

TTACCATGAGTTTCTAGGTAGCTTTATGCGTGACTTTGTTAGCATGGGAGTAGCAGGAGCACATGGAAA 

AACTTCAACGACAGGTATGTTGTCTCATGTCTTGTCTCACATTACAGATACCAGCTTCTTGATTGGAGA 

TGGGACAGGTCGTGGTTCGGCCAATGCCAAATATTTTGTCTTTGAATCTGACGAATATGAGCGTCACTT 

CATGCCTTACCACCCAGAATACTCTATTATCACCAACATTGACTTTGACCATCCAGATTATTTCACAAG 

TCTCGAGGATGTTTTTAATGCCTTTAACGACTATGCCAAACAAATCACCAAGGGTCTTTTTGTCTATGG 

TGAAGATGCTGAATTGCGTAAGATTACGTCTGATGCACCAATTTATTATTATGGTTTTGAAGCTGAAGG 

CAATGACTTTGTAGCTAGTGATCTTCTTCGTTCAATAACTGGTTCAACCTTCACCGTTCATTTCCGTGG 

ACAAAACTTGGGGCAATTCCACATTCCAACCTTTGGTCGTCACAATATCATGAATGCGACAGCCGTTAT 

TGGTCTTCTTTACACAGCAGGATTTGATTTGAACTTGGTGCGTGAGCACTTGAAAACATTTGCCGGTGT 

TAAACGTCGTTTCACTGAGAAAATTGTCAATGATACAGTGATTATCGATGACTTTGCCCACCATCCAAC 

AGAAATTATTGCGACCTTGGATGCGGCTCGTCAGAAATACCCAAGCAAGGAAATTGTAGCAGTCTTTCA 

ACCGCATACCTTTACAAGAACCATTGCCTTGTTGGACGACTTTGCCCATGCTTTAAACCAAGCAGATGC 

TGTTTATCTAGCGCAAATTTATGGCTCGGCTCGTGAAGTAGATCATGGTGACGTTAAGGTAGAAGACCT 

AGCCAACAAAATCAACAAAAAACACCAAGTGATTACTGTTGAAAATGTTTCTCCACTCCTAGACCATGA 

CAATGCTGTTTACGTCTTTATGGGAGCAGGAGACATCCAAACCTATGAATACTCATTTGAGCGTCTCTT 

GTCTAACTTGACAAGCAATGTTCAA 


BNSDOCID <WO 9B18930A2J_> 



WO 98/18930 

Table 1 


73 


PCT/US97/19422 


SP070 amino acid (SEQ ID NO;116) 

hqmghkvqgsdvekyyftqrgleqagitilpfdeknldgdmeiiagnafrpdnnveiayadqngisykr 

YHEFLGSFMRDFVSMGVAGAHGKTSTTGMLSHVLSHITDTSFLIGDGTGRGSANAKYFVFESDEYERHF 

MPYHPEYSIITNIDFDHPDYFTSLEDVFNAFNDYAKQITKGLFVYGEDAELRKITSDAPIYYYGFEAEG 

NDFVASDLLRSITGSTFTVHFRGQNLGQFHIPTFGRHNIMNATAVIGLLYTAGFDLNLVREHLKTFAGV 

KRRFTEKIVNDTVIIDDFAHHPTEIIATLDAARQKYPSKEIVAVFQPHTFTRTIALLDDFAHALNQADA 

VYLAQIYGSAREVDHGDVKVEDLANKINKKHQVITVENVSPLLDHDNAVYVFMGAGDIQTYEYSFERLL 

SNLTSNVQ 

SP071 nucleotide (SEQ ID NO;117) 

TTTTAACCCAACTGTTGGTACTTTCCTTTTTACTGCAGGATTGAGCTTGTTAGTTTTATTGGTTTCTAA 
AAGGGAAAATGGAAAGAAACGACTTGTTCATTTTCTGCTGTTGACTAGCATGGGAGTTCAATTGTTGCC 
GGCCAGTGCTTTTGGGTTGACCAGCCAGATTTTATCTGCCTATAATAGTCAGCTTTCTATCGGAGTCGG 
GGAACATTTACCAGAGCCTCTGAAAATCGAAGGTTATCAATATATTGGTTATATCAAAACTAAGAAACA 
GGATAATACAGAGCTTTCAAGGACAGTTGATGGGAAATACTCTGCTCAAAGAGATAGTCAACCAAACTC 
TACAAAAACATCAGATGTAGTTCATTCAGCTGATTTAGAATGGAACCAAGGACAGGGGAAGGTTAGTTT 
ACAAGGTGAAGCATCAGGGGATGATGGACTTTCAGAAAAATCTTCTATAGCAGCAGACAATCTATCTTC 
TAATGATTCATTCGCAAGTCAAGTTGAGCAGAATCCGGATCACAAAGGAGAATCTGTAGTTCGACCAAC 
AGTGCCAGAACAAGGAAATCCTGTGTCTGCTACAACGGTGCAGAGTGCGGAAGAGGAAGTATTGGCGAC 
GACAAATGATCGACCAGAGTATAAACTTCCATTGGAAACCAAAGGCACGCAAGAACCCGGTCATGAGGG 
TGAAGCCGCAGTCCGTGAAGACTTACCAGTCTACACTAAGCCACTAGAAACCAAAGGTACACAAGGACC 
CGGACATGAAGGTGAAGCTGCAGTTCGCGAGGAAGAACCAGCTTACACAGAACCGTTAGCAACGAAAGG 
CACGCAAGAGCCAGGTCATGAGGGCAAAGCTACAGTCCGCGAAGAGACTCTAGAGTACACGGAACCGGT 
AGCGACAAAAGGCACACAAGAACCCGAACATGAGGGCGAaCGGsCAGTAGAAGAAGAACTTCCGGCTTT 
AGAGGTCACTACACGAAATAGAACGGAAATCCAGAATATTCCTTATACAACAGAAGAAATTCAGGATCC 
AACACTTCTGAAAAATCGTCGTAAGATTGAACGACAAGGGCAAGCAGGGACACGTACAATTCAATATGA 
AGACTACATCGTAAATGGTAATGTCGTAGAAACTAAAGAAGTGTCACGAACTGAAGTAGCTCCGGTCAA 
CGAAGTCGTTAAAGTAGGAACACTTGTGAAAGTTAAACCTACAGTAGAAATTACAAACTTAACAAAAGT 
TGAGAACAAAAAATCTATAACTGTAAGTTATAACTTAATAGACACTACCTCAGCATATGTTTCTGCAAA 
AACGCAAGTTTTCCATGGAGACAAGCTAGTTAAAGAGGTGGATATAGAAAATCCTGCCAAAGAGCAAGT 
AATATCAGGTTTAGATTACTACACACCGTATACAGTTAAAACACACCTAACTTATAATTTGGGTGAAAA 
T AATGAGG AAAAT ACTGAAAC ATC AAC TC AAG ATTTCC AATT AG AGT AT AAGAAAAT AG AG AT T AAAG A 
TATTGATTCAGTAGAATTATACGGTAAAGAAAATGATCGTTATCGTAGATATTTAAGTCTAAGTGAAGC 
GCCGACTGATACGGCTAAATACTTTGTAAAAGTGAAATCAGATCGCTTCAAAGAAATGTACCTACCTGT 
AAAATCTATTACAGAAAATACGGATGGAACGTATAAAGTGACGGTAGCCGTTGATCAACTTGTCGAAGA 
AGGTACAGACGGTTACAAAGATGATTACACATTTACTGTAGCTAAATCTAAAGCAGAGCAACCAGGAGT 
TTACACATCCTTTAAACAGCTGGTAACAGCCATGCAAAGCAATCTGTCTGGTGTCTATACATTGGCTTC 
AGATATGACCGCAGATGAGGTGAGCTTAGGCGATAAGCAGACAAGTTATCTCACAGGTGCATTTACAGG 
GAGCTTGATCGGTTCTGATGGAACAAAATCGTATGCCATTTATGATTTGAAGAAACCATTATTTGATAC 
ATTAAATGGTGCTACAGTTAGAGATTTGGATATTAAAACTGTTTCTGCTGATAGTAAAGAAAATGTCGC 
AGCGCTGGCGAAGGCAGCGAATAGCGCGAATATTAATAATGTTGCAGTAGAAGGAAAAATCTCAGGTGC 
GAAATCTGTTGCGGGATTAGTAGCGAGCGCAACAAATACAGTGATAGAAAACAGCTCGTTTACAGGGAA 
ACTTATCGCAAATCACCAGGACAGTAATAAAAATGATACTGGAGGAATAGTAGGTAATATAACAGGAAA 
TAGTTCGAGAGTTAATAAAGTTAGGGTAGATGCCTTAATCTCTACTAATGCACGCAATAATAACCAAAC 
AGCTGGAGGGATAGTAGGTAGATTAGAAAATGGTGCATTGATATCTAATTCGGTTGCTACTGGAGAAAT 
ACGAAATGGTCAAGGATATTCTAGAGTCGGAGGAATAGTAGGATCTACGTGGCAAAACGGTCGAGTAAA 
TAATGTTGTGAGTAACGTAGATGTTGGAGATGGTTATGTTATCACCGGTGATCAATACGCAGCAGCAGA 
TGTGAAAAATGCAAGTACATCAGTTGATAATAGAAAAGCAGACAGATTCGCTACAAAATTATCAAAAGA 
CCAAATAGACGCGAAAGTTGCTGATTATGGAATCACAGTAACTCTTGATGATACTGGGCAAGATTTAAA 
ACGTAATCTAAGAGAAGTTGATTATACAAGACTAAATAAAGCAGAAGCTGAAAGAAAAGTAGCTTATAG 
CAACATAGAAAAACTGATGCCATTCTACAATAAAGACCTAGTAGTTCACTATGGTAACAAAGTAGCGAC 
AACAGATAAACTTTACACTACAGAATTGTTAGATGTTGTGCCGATGAAAGATGATGAAGTAGTAACGGA 
TATTAATAATAAGAAAAATTCAATAAATAAAGTTATGTTACATTTCAAAGATAATACAGTAGAATACCT 
AGATGTAACATTCAAAGAAAACTTCATAAACAGTCAAGTAATCGAATACAATGTTACAGGAAAAGAATA 
TATATTCACACCAGAAGCATTTGTTTCAGACTATACAGCGATAACGAATAACGTACTAAGCGACTTGCA 
AAATGTAACACTTAAC 

SP071 amino acid (SEQ ID NO: 118) 
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FNPTVGTFLFTAGLSLLVLLVSKRENGKKRLVHFLLLTSMGVQLLPASAFGLTSQILSAYNSQLSIGVG 

EHLPEPLKIEGYQYIGYIKTKKQDNTELSRTVDGKYSAQRDSQPNSTKTSDWHSADLEWNQGQGKVSL 

qgeasgddglsekssiaadnlssndsfasqveqnpdhkgeswrptvpeqgnpvsattvqsaeeevlat 

TNDRPEYKLPLETKGTQEPGHEGEAAVREDLPVYTKPLETKGTQGPGHEGEAAVREEEPAYTEPLATKG 
TQEPGHEGKATVREETLEYTEPVATKGTQEPEHEGERXVEEELPALEVTTRNRTEIQNIPYTTEEIQDP 
TLLKNRRKIERQGQAGTRTIQYEDYIVNGNWETKEVSRTEVAPVNEWKVGTLVKVKPTVEITNLTKV 
ENKKS ITVS YNL. IDTTS A YV SAKTQVFHGDKLVKEVDI ENP AKEQVI SGLDYYT P YTVKTHLT YNLGEN 
NEENTETSTQDFQLEYKKIEIKDIDSVELYGKENDRYRRYLSLSEAPTDTAKYFVKVKSDRFKEMYLPV 
KSXTENTDGTYKVTVAVDQLVEEGTDGYKDDYTFTVAKSKAEQPGVYTSFKQLVTAMQSNLSGVYTLAS 
DMTADEVSLGDKQTSYLTGAFTGSLIGSDGTKSYAIYDLKKPLFDTLNGATVRDLDIKTVSADSKENVA 
ALAKAANSANINNVAVEGKISGAKSVAGLVASATNTVIENSSFTGKLIANHQDSNKNDTGGIVGNITGN 
S S RVNKVRVDALI STNARNNNQTAGG IVGRLENGAL I SNS VATGEI RNGQGY SRVGG I VGSTWQNGRVN 
NWSNVDVGDGYVITGDQYAAADVKNASTSVDNRKADRFATKLSKDQIDAKVADYGITVTLDDTGQDLK 
RNLREVDYTRLNKAEAERKVAYSNIEKLMPFYNKDLWHYGNKVATTDKLYTTELLDWPMKDDEWTD 
INNKKNSINKVMLHFKDNTVEYLDVTFKENFINSQVIEYNVTGKEYIFTPEAFVSDYTAITNNVLSDLQ 
NVTLN 

SP072 nucleotide ( SEQ ID NO: 119) 

TTTTAACCCAACTGTTGGTACTTTCCTTTTTACTGCAGGATTGAGCTTGTTAGTTTTATTGGTTTCTAA 

AAGGGAAAATGGAAAGAAACGACTTGTTCATTTTCTGCTGTTGACTAGCATGGGAGTTCAATTGTTGCC 

GGCCAGTGCTTTTGGGTTGACCAGCCAGATTTTATCTGCCTATAATAGTCAGCTTTCTATCGGAGTCGG 

GGAACATTTACCAGAGCCTCTGAAAATCGAAGGTTATCAATATATTGGTTATATCAAAACTAAGAAACA 

GGATAATACAGAGCTTTCAAGGACAGTTGATGGGAAATACTCTGCTCAAAGAGATAGTCAACCAAACTC 

TACAAAAACATCAGATGTAGTTCATTCAGCTGATTTAGAATGGAACCAAGGACAGGGGAAGGTTAGTTT 

ACAAGGTGAAGCATCAGGGGATGATGGACTTTCAGAAAAATCTTCTATAGCAGCAGACAATCTATCTTC 

TAATGATTCATTCGCAAGTCAAGTTGAGCAGAATCCGGATCACAAAGGAGAATCTGTAGTTCGACCAAC 

AGTGCCAGAACAAGGAAATCCTGTGTCTGCTACAACGGTGCAGAGTGCGGAAGAGGAAGTATTGGCGAC 

GACAAATGATCGACCAGAGTATAAACTTCCATTGGAAACCAAAGGCACGCAAGAACCCGGTCATGAGGG 

TGAAGCCGCAGTCCGTGAAGACTTACCAGTCTACACTAAGCCACTAGAAACCAAAGGTACACAAGGACC 

CGGACATGAAGGTGAAGCTGCAGTTCGCGAGGAAGAACCAGCTTACACAGAACCGTTAGCAACGAAAGG 

CACGCAAGAGCCAGGTCATGAGGGCAAAGCTACAGTCCGCGAAGAGACTCTAGAGTACACGGAACCGGT 

AGCGACAAAAGGCACACAAGAACCCGAACATGAGGGCGAaCGGsCAGTAGAAGAAGAACTTCCGGCTTT 

AGAGGTCACTACACGAAATAGAACGGAAATCCAGAATATTCCTTATACAACAGAAGAAATTCAGGATCC 

AACACTTCTGAAAAATCGTCGTAAGATTGAACGACAAGGGCAAGCAGGGACACGTACAATTCAATATGA 

AGACTACATCGTAAATGGTAATGTCGTAGAAACTAAAGAAGTGTCACGAACTGAAGTAGCTCCGGTCAA 

CGAAGTCGTTAAAGTAGGAACACTTGTGAAAGTTAAACCTACAGTAGAAATTACAAACTTAACAAAAGT 

TGAGAACAAAAAATCTATAACTGTAAGTTATAACTTAATAGACACTACCTCAGCATATGTTTCTGCAAA 

AACGCAAGTTTTCCATGGAGACAAGCTAGTTAAAGAGGTGGATATAGAAAATCCTGCCAAAGAGCAAGT 

AATATCAGGTTTAGATTACTACACACCGTATACAGTTAAAACACACCTAACTTATAATTTGGGTGAAAA 

TAATGAGGAAAATACTGAAACATCAACTCAAGATTTCCAATTAGAGTATAAGAAAATAGAGATTAAAGA 

TATTGATTCAGTAGAATTATACGGTAAAGAAAATGATCGTTATCGTAGA 

SP072 amino acid (SEQ ID NO: 120) 

FNPTVGTFLFTAGLSLLVLLVSKRENGKKRLVHFLLLTSMGVQLLPASAFGLTSQILSAYNSQLSXGVG 

EHLPEPLKIEGYQYIGYIKTKKQDNTELSRTVDGKYSAQRDSQPNSTKTSDWHSADLEWNQGQGKVSL 

QGEASGDDGLSEKSSIAADNLSSNDSFASQVEQNPDHKGESWRPTVPEQGNPVSATTVQSAEEEVLAT 

TNDRPEYKLPLETKGTQEPGHEGEAAVREDLPVYTKPLETKGTQGPGHEGEAAVREEEPAYTEPLATKG 

TQEPGHEGKATVREETLEYTEPVATKGTQEPEHEGERXVEEELPALEVTTRNRTEIQNIPYTTEEIQDP 

TLLKNRRKIERQGQAGTRTXQYEDYIVNGNWETKEVSRTEVAPVNEWKVGTLVKVKPTVEITNLTKV 

ENKKSITVSYNLIDTTSAYVSAKTQVFHGDKLVKEVDIENPAKEQVISGLDYYTPYTVKTHLTYNLGEN 

NEENTETSTQDFQLEYKKIEIKDIDSVELYGKENDRYRR 

SP073 nucleotide (SEQ ID NO:121> 

TCGTAGATATTTAAGTCTAAGTGAAGCGCCGACTGATACGGCTAAATACTTTGTAAAAGTGAAATCAGA 

TCGCTTCAAAGAAATGTACCTACCTGTAAAATCTATTACAGAAAATACGGATGGAACGTATAAAGTGAC 

GGTAGCCGTTGATCAACTTGTCGAAGAAGGTACAGACGGTTACAAAGATGATTACACATTTACTGTAGC 

TAAATCTAAAGCAGAGCAACCAGGAGTTTACACATCCTTTAAACAGCTGGTAACAGCCATGCAAAGCAA 

TCTGTCTGGTGTCTATACATTGGCTTCAGATATGACCGCAGATGAGGTGAGCTTAGGCGATAAGCAGAC 
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AAGTTATCTCACAGGTGCATTTACAGGGAGCTTGATCGGTTCTGATGGAACAAAATCGTATGCCATTTA 

TGATTTGAAGAAACCATTATTTGATACATTAAATGGTGCTACAGTTAGAGATTTGGATATTAAAACTGT 

TTCTGCTGATAGTAAAGAAAATGTCGCAGCGCTGGCGAAGGCAGCGAATAGCGCGAATATTAATAATGT 

TGCAGTAGAAGGAAAAATCTCAGGTGCGAAATCTGTTGCGGGATTAGTAGCGAGCGCAACAAATACAGT 

GATAGAAAACAGCTCGTTTACAGGGAAACTTATCGCAAATCACCAGGACAGTAATAAAAATGATACTGG 

AGGAATAGTAGGTAATATAACAGGAAATAGTTCGAGAGTTAATAAAGTTAGGGTAGATGCCTTAATCTC 

TACTAATGCACGCAATAATAACCAAACAGCTGGAGGGATAGTAGGTAGATTAGAAAATGGTGCATTGAT 

ATCTAATTCGGTTGCTACTGGAGAAATACGAAATGGTCAAGGATATTCTAGAGTCGGAGGAATAGTAGG 

ATCTACGTGGCAAAACGGTCGAGTAAATAATGTTGTGAGTAACGTAGATGTTGGAGATGGTTATGTTAT 

CACCGGTGATCAATACGCAGCAGCAGATGTGAAAAATGCAAGTACATCAGTTGATAATAGAAAAGCAGA 

CAGATTCGCTACAAAATTATCAAAAGACCAAATAGACGCGAAAGTTGCTGATTATGGAATCACAGTAAC 

TCTTGATGATACTGGGCAAGATTTAAAACGTAATCTAAGAGAAGTTGATTATACAAGACTAAATAAAGC 

AGAAGCTGAAAGAAAAGTAGCTTATAGCAACATAGAAAAACTGATGCCATTCTACAATAAAGACCTAGT 

AGTTCACTATGGTAACAAAGTAGCGACAACAGATAAACTTTACACTACAGAATTGTTAGATGTTGTGCC 

GATGAAAGATGATGAAGTAGTAACGGATATTAATAATAAGAAAAATTCAATAAATAAAGTTATGTTACA 

TTTCAAAGATAATACAGTAGAATACCTAGATGTAACATTCAAAGAAAACTTCATAAACAGTCAAGTAAT 

CGAATACAATGTTACAGGAAAAGAATATATATTCACACCAGAAGCATTTGTTTCAGACTATACAGCGAT 

AACGAATAACGTACTAAGCGACTTGCAAAATGTAACACTTAAC 

SP073 amino acid (SEQ ID NO: 122) 

RR YL S L S EAPTDT AKY FVKVK S DRF KEMY L P VKS I TENTIX3T YKVTV A VDQ L VE EGTDG YKDD YTFTV A 
KSKAEQPGVYTSFKQLVTAMQSNLSGVYTLASDMTADEVSLGDKQTSYLTGAFTGSLIGSDGTKSYAIY 
DLKKPLFDTLNGATVRDLDIKTVSADSKENVAALAKAANSANINNVAVEGKISGAKSVAGLVASATNTV 
IENSSFTGKLIANHQDSNKNDTGGIVGNITGNSSRVNKVRVDALISTNARNNNQTAGGIVGRLENGALI 
SNSVATGEIRNGQGYSRVGGIVGSTWQNGRVNNWSNVDVGDGYVITGDQYAAADVKNASTSVDNRKAD 
RFATKLSKDQIDAKVADYGITVTLDDTGQDLKRNLREVDYTRLNKAEAERKVAYSNIEKLMPFYNKDLV 
VHYGNKVATTDKLYTTELLDWPMKDDEWTDINNKKNSINKVMLHFKDNTVEYLDVTFKENFINSQVI 
EYNVTGKEYIFTPEAFVSDYTAITNNVLSDLQNVTLN 

SP074 nucleotide (SEQ ID NO: 123) 

CTTTGGTTTTGAAGGAAGTAAGCGTGGACAATTTGCTGTAGAAGGAATCAATCAACTTCGTGAGCATGT 

AGACACTCTATTGATTATCTCAAACAACAATTTGCTTGAAATTGTTGATAAGAAAACACCGCTTTTGGA 

GGCTCTTAGCGAAGCGGATAACGTTCTTCGTCAAGGTGTTCAAGGGATTACCGATTTGATTACCAATCC 

AGGATTGATTAACCTTGACTTTGCCGATGTGAAAACGGTAATGGCAAACAAAGGGAATGCTCTTATGGG 

TATTGGTATCGGTAGTGGAGAAGAACGTGTGGTAGAAGCGGCACGTAAGGCAATCTATTCACCACTTCT 

TGAAACAACTATTGACGGTGCTGAGGATGTTATCGTCAACGTTACTGGTGGTCTTGACTTAACCTTGAT 

TGAGGCAGAAGAGGCTTCACAAATTGTGAACCAGGCAGCAGGTCAAGGAGTGAACATCTGGCTCGGTAC 

TTCAATTGATGAAAGTATGCGTGATGAAATTCGTGTAACAGTTGTTGCAACGGGTGTTCGTCAAGACCG 

CGTAGAAAAGGTTGTGGCTCCACAAGCTAGATCTGCTACTAACTACCGTGAGACAGTGAAACCAGCTCA 

TTCACATGGCTTTGATCGTCATTTTGATATGGCAGAAACAGTTGAATTGCCAAAACAAAATCCACGTCG 

TTTGGAACCAACTCAGGCATCTGCTTTTGGTGATTGGGATCTTCGCCGTGAATCGATTGTTCGTACAAC 

AGATTCAGTCGTTTCTCCAGTCGAGCGCTTTGAAGCCCCAATTTCACAAGATGAAGATGAATTGGATAC 

ACCTCCATTTTTCAAAAATCGT 

SP074 amino acid (SEQ ID NO: 124) 

FGFEGSKRGQFAVEGINQLREKVDTLLIISNNNLLEIVDKKTPLLEALSEADNVLRQGVQGITDLITNP 

GLINLDFADVKTVMANKGNALMGIGIGSGEERWEAARKAIYSPLLETTIDGAEDVIVNVTGGLDLTLI 

EAEEASQIVNQAAGQGVNIWLGTSIDESMRDEIRVTWATGVRQDRVEKWAPQARSATNYRETVKPAH 

SHGFDRHFDMAETVELPKQNPRRLEPTQASAFGDWDLRRESIVRTTDSWSPVERFEAPISQDEDELDT 

PPFFKNR 

SP075 nucleotide (SEQ ID NO: 125) 

CTACTACCTCTCGAGAGAAAGTGACCTAGAGGTGACCGTTTTTGACCATGAGCAAGGTCAAGCCACCAA 

GGCCGCAGCAGGAATTATCAGTCCTTGGTTTTCCAAACGCCGTAATAAAGCCTGGTACAAGATGGCGCG 

CTTGGGGGCTGATTTTTATGTGGATTTATTAGCTGATTTAGAGAAATCAGGACAAGAAATCGACTTTTA 

CCAGCGTTCGGGAGTCTTTCTCTTGAAAAAGGATGAATCCAATTTGGAAGAACTTTATCAACTGGCCCT 

CCAGCGCAGAGAAGAATCTCCCTTGATAGGGCAATTAGCCATTCTGAACCAAGCCTCAGCTAATGAATT 

ATTCCCTGGTTTGCAGGGATTTGACCGCCTGCTCTATGCTTCTGGTGGAGCGAGAGTAGATGGCCAACT 
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TTTAGTGACTCGTTTGCTGGAAGTCAGTCATGTCAAGCTGGTCAAAGAAAAAGTGACTCTGACACCGTT 

AGCATCAGGCTACCAGATTGGTGAAGAGGAGTTTGAGCAGGTTATTTTGGCGACGGGAGCTTGGTTGGG 

GGACATGTTAGAGCCTTTAGGTTATGAAGTGGATGTCCGTCCTCAAAAAGGACAACTACGAGATTATCA 

GCTTGCCCAAGACATGGAAGATTACCCTGTTGTCATGCCAGAAGGGGAGTGGGATTTGATTCCCTTTGC 

AGGTGGGAAATTATCCTTAGGCGCTACCCACGAAAATGACATGGGATTTGATTTGACGGTAGATGAAAC 

CTTGCTCCAACAAATGGAGGAGGCCACCTTGACTCACTATCTGATTTTGGCTGAAGCTACTTCAAAATC 

TGAGCGTGTTGGAATCCGTGCCTACACCAGTGATTTCTCTCCTTTCTTTGGGCAGGTGCCTGACTTAAC 

TGGTGTCTATGCAGCCAGTGGACTAGGTTCATCAGGCCTCACAACTGGTCCTATCATTGGTTACCATCT 

AGCCCAACTGATCCAAGACAAGGAGTTGACCTTGGACCCTCTAAATTACCCAATTGAAAACTATGTCAA 

ACGAGTAAAAAGCGAA 

SP075 amino acid (SEQ 10 NO: 126) 

YYLSRESDLEVTVFDHEQGQATKAAAGI I SPWFSKRRNKAWYKMARLGADFYVDLLADLEKSGQEIDFY 
QRSGVFLLKKDESNLEELYQLALQRREESPLIGQLAILNQASANELFPGLQGFDRLLYASGGARVDGQL 
LVTRLLEVSHVKLVKEKVTLTPLASGYQIGEEEFEQVILATGAWLGDMLEPLGYEVDVRPQKGQLRDYQ 
LAQDMEDYP WMPEGEWDLI PFAGGKLSLGATHENDMGFDLTVDETLLQQMEEATLTHYLI LAEATSKS 
ERVGIRAYTSDFSPFFGQVPDLTGVYAASGLGSSGLTTGPIIGYHLAQLIQDKELTLDPLNYPIENYVK 
RVKSE 

SP076 nucleotide (SEQ ID NO:127) 

TAAGGTCAAAAGTCAGACCGCTAAGAAAGTGCTAGAAAAGATTGGAGCTGACTCGGTTATCTCGCCAGA 

GTATGAAATGGGGCAGTCTCTAGCACAGACCATTCTTTTCCATAATAGTGTTGATGTCTTTCAGTTGGA 

TAAAAATGTGTCTATCGTGGAGATGAAAATTCCTCAGTCTTGGGCAGGTCAAAGTCTGAGTAAATTAGA 

CCTCCGTGGCAAATACAATCTGAATATTTTGGGTTTCCGAGAGCAGGAAAATTCCCCATTGGATGTTGA 

ATTTGGACCAGATGACCTCTTGAAAGCAGATACCTATATTTTGGCAGTCATCAACAACCAGTATTTGGA 

TACCCTA 

SP07 6 amino acid (SEQ ID NO: 12 8) 

KVKSQTAKKVLEKIGADSVISPEYEMGQSLAQTILFHNSVDVFQLDKNVSIVEMKIPQSWAGQSLSKLD 

LRGKYNLNILGFREQENSPLDVEFGPDDLLKADTYILAVINNQYLDTL 

SP 077 nucleotide (SEQ ID NO: 129) 

TGACGGGTCTCAGGATCAGACTCAGGAAATCGCTGAGTGTTTAGCTAGCAAGTATCCTAATATCGTTAG 

AGCCATCTATCAGGAAAATAAATGCCATGGCGGTGCGGTCAATCGTGGCTTGGTAGAGGCTTCTGGGCG 

CTATTTTAAAGTAGTTGACAGTGATGACTGGGTGGATCCTCGTGCCTACTTGAAAATTCTTGAAACTTG 

CAGGAACTTGAGAGCAAAGGTCAAGAGGTGGATGTCTTTG 

SP077 amino acid (SEQ ID NO: 130) 

DGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLVEASGRYFKWDSDDWVDPRAYLKILETC 

RNLRAKVKRWMSL 

SP078 nucleotide (SEQ ID NO: 131) 

TAGAGGCTTTGCCAAATGGTGGGAAGGGCACGAGCGTCGAAAAGAGGAACGCTTTGTCAAACAAGAAGA 
AAAAGCTCGCCAAAAGGCTGAGAAAGAGGCTAGATTAGAACAAGAAGAGACTGAAAAAGCCTTACTCGA 
TTTGCCTCCTGTTGATATGGAAACGGGTGAAATTCTGACAGAGGAAGCTGTTCAAAATCTTCCACCTAT 
TCCAGAAGAAAAGTGGGTGGAACCAGAAATCATCCTGCCTCAAGCTGAACTTAAATTCCCTGAACAGGA 
AGATGACTCAGATGACGAAGATGTTCAGGTCGATTTTTCAGCCAAAGAAGCCCTTGAATACAAACTTCC 
AAGCTTACAACTCTTTGCACCAGATAAACCAAAAGATCAGTCTAAAGAGAAGAAAATTGTCAGAGAAAA 
TATCAAAATCTTAGAAGCAACCTTTGCTAGCTTTGGTATTAAGGTAACAGTTGAACGGGCCGAAATTGG 
GCCATCAGTGACCAAGTATGAAGTCAAGCCGGCTGTTGGTGTAAGGGTCAACCGCATTTCCAATCTATC 
AGATGACCTCGCTCTAGCCTTGGCTGCCAAAGATGTCCGGATTGAAGCACCAATCCCTGGGAAATCCCT 
AATCGGAATTGAAGTGCCCAACTCCGATATTGCCACTGTATCTTTCCGAGAACTATGGGAACAATCGCA 
AACGAAAGCAGAAAATTTCTTGGAAATTCCTTTAGGGAAGGCTGTTAATGGAACCGCAAGAGCTTTTGA 
CCTTTCTAAAATGCCCCACTTGCTAGTTGCAGGTTCAACGGGTTCAGGGAAGTCAGTAGCAGTTAACGG 
CATTATTGCTAGCATTCTCATGAAGGCGAGACCAGATCAAGTTAAATTTATGATGGTCGATCCCAAGAT 
GGTTGAGTTATCTGTTTAC AATGAT ATTC C C C ACCTCTTGATTCC AGTCGTGACCAATCC ACGC AAAGC 
CAGCAAGGCTCTGCAAAAGGTTGTGGATGAAATGGAAAACCGTTATGAACTCTTTGCCAAGGTGGGAGT 
TCGGAATATTGCAGGTTTTAATGCCAAGGTAGAAGAGTTCAATTCCCAGTCTGAGTACAAGCAAATTCC 
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GCTACCATTCATTGTCGTGATTGTGGATGAGTTGGCTGACCTCATGATGGTGGCCAGCAAGGAAGTGGA 

AGATGCTATCATCCGTCTTGGGCAGAAGGCGCGTGCTGCAGGTATCCACATGATTCTTGCAACTCAGCG 

TCCATCTGTTGATGTCATCTCTGGTTTGATTAAGGCCAATGTTCCATCTCGTGTAGCATTTGCGGTTTC 

ATCAGGAACAGACTCCCGTACGATTTTGGATGAAAATGGAGCAGAAAAACTTCTTGGTGGAGGAGACAT 

GCTCTTTAAACCGATTGATGAAAATCATCCAGTTCGTCTCCAAGGCTCCTTTATCTCGGATGACGATGT 

TGAGCGCATTGTGAACTTCATCAAGACTCAGGCAGATGCAGACTACGATGAGAGTTTTGATCCAGGTGA 

GGTTTCTGAAAATGAAGGAGAATTTTCGGATGGAGATGCTGGTGGTGATCCGCTTTTTGAAGAAGCTAA 

GTCTTTGGTTATCGAAACACAGAAAGCCAGTGCGTCTATGATTCAGCGTCGTTTATCAGTTGGATTTAA 

CCGTGCGACCCGTCTCATGGAAGAACTGGAGATAGCAGGTGTCATCGGTCCAGCTGAAGGTACCAAACC 

TCGAAAAGTGTTACAACAA 

SF07 8 amino acid { SEQ ID NO: 132) 

RGFAKWWEGHERRKEERFVKQEEKARQKAEKEARL EQEETEKALLDL P P VDMETGE I LTEEAVQNL P P I 
PEEKWVEPEIILPQAELKFPEQEDDSDDEDVQVDFSAKEALEYKLPSLQLFAPDKPKDQSKEKKIVREN 
IKILEATFASFGIKVTVERAEIGPSVTKYEVKPAVGVRVNRISNLSDDLALALAAKDVRIEAPIPGKSL 
IGIEVPNSDIATVSFRELWEQSQTKAENFLEIPLGKAVNGTARAFDLSKMPHLLVAGSTGSGXSVAVNG 
IIASIMKARPDQVKFMMVDPKMVELSVYNDIPHLLIPVVTNPRKASKALQKVVDEMENRYELFAKVGV 
RNIAGFNAKVEEFNSQSEYKQIPLPFIWIVDELADLMMVASKEVEDAIIRLGQKARAAGIHMILATQR 
PSVDVISGLIKANVPSRVAFAVSSGTDSRTILDENGAEKLLGRGDMLFKPIDENHPVRLQGSFI5DDDV 
ERIVNFIKTQADADYDESFDPGEVSENEGEFSDGDAGGDPLFEEAXSLVIETQKASASMIQRRLSVGFN 
RATRLMEELEIAGVIGPAEGTKPRKVLQQ 

SP079 nucleotide (SEQ ID NO: 133) 

TCAAAAAGAGAAGGAAAACTTGGTTATTGCTGGGAAAATAGGTCCAGAACCAGAAATTTTC-GCCAATAT 

GTATAAGTTGCTGATXGAAGAAAATACCAGCATGACTGCGACTGTTAAACCGAATTTTGGGAAGACAAG 

CTTCCTTTATGAAGCTCTGAAAAAAGGCGATATTGACATCTATCCTGAATTTACTGGTACGGTGACTGA 

AAGTTTGCTTCAACCATCACCCAAGGTGAGTCATGAACCAGAACAGGTTTATCAGGTGGCGCGTGATGG 

CATTGCTAAGCAGGATCATCTAGCCTATCTCAAACCCATGTCTTATCAAAACACCTATGCTGTAGCTGT 

TCCGAAAAAGATTGCTCAAGAATATGGCTTGAAGACCATTTCAGACTTGAAAAAAGTGGAAGGGCAGTT 

GAAGGCAGGTTTTACACTCGAGTTTAACGACCGTGAAGATGGAAATAAGGGCTTGCAATCAATGTATGG 

TCTCAATCTCAATGTAGCGACCATTGAGCCAGCCCTTCGCTATCAGGCTATTCAGTCAGGGGATATTCA 

AATCACGGATGCCTATTCGACTGATGCGGAATTGGAGCGTTATGATTTACAGGTCTTGGAAGATGACAA 

GCAACTCTTCCCACCTTATCAAGGGGCTCCACTCATGAAAGAAGCTCTTCTCAAGAAACACCCAGAGTT 

GGAAAGAGTTCTTAATACATTGGCTGGTAAGATTACAGAAAGCCAGATGAGCCAGCTCAACTACCAAGT 

CGGTGTTGAAGGCAAGTCAGCAAAGCAAGTAGCCAAGGAGTTTCTCCAAGAACAAGGTTTGTTGAAGAA 

A 


SP079 amino acid (SEQ ID NO: 134) 

QKEKENLVIAGKIGPEPEILANMYKLLIEENTSMTATVKPNFGKTSFLYEALKKGDIDIYPEFTGTVTE 

SLLQPSPKVSHEPEQVYQVARDGIAKQDHLAYLKPMSYQNTYAVAVPKKIAQEYGLKTISDLKKVEGQL 

KAGFTLEFNDREDGNKGLQSMYGLNLNVATIEPALRYQAIQSGDIQITDAYSTDAELERYDLQVLEDDK 

QLFPPYQGAPLMKEALLKKHPELERVLNTLAGKITESQMSQLNYQVGVEGKSAKQVAKEFLQEQGLLKK 

SP080 nucleotide (SEQ ID NO:135) 

ACGTTCTATTGAGGACCACTTTGATTCAAACTTCGAATTGGAATATAACCTCAAAGAAAAAGGGAAAAC 

AGATCTTTTGAAGCTAGTTGATAAAACAACTGACATGCGTCTGCATTTTATCCGCCAAACTCATCCACG 

CGGTCTCGGAGATGCTGTTTTGCAAGCCAAGGCTTTCGTCGGAAATGAACCTTTTGTCGTTATGCTTGG 

TGATGACTTGATGGATATCACAGACGAAAAGGCTGTTCCACTTACCAAACAACTCATGGATGACTACGA 

GCGTACCCACGCGTCTACTATCGCTGTCATGCCAGTCCCTCATGACGAAGTATCTGCTTACGGGGTTAT 

TGCTCCGCAAGGCGAAGGAAAAGATGGTCTTTACAGTGTTGAAACCTTTGTTGAAAAACCAGCTCCAGA 

GGACGCTCCTAGCGACCTTGCTATTATCGGACGCTACCTCCTCACGCCTGAAATTTTTGAGATTCTCGA 

AAAGCAAGCTCCAGGTGCAGGAAATGAAATTCAGCTGACAGATGCAATCGACACCCTCAATAAAACACA 

ACGTGTATTTGCTCGTGAGTTCAAAGGGGCTCGTTACGATGTCGGAGACAAGTTTGGCTTCATGAAAAC 

ATCCATCGACTACGCCCTCAAACACCCACAAGTCAAAGATGATTTGAAGAATTACCTCATCCAACTTGG 

AAAAGAATTGACTGAGAAGGAA 
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SP080 amino acid (SEQ ID NO:136) 

RSIEDHFDSNFELEYNLKEKGKTDLLKLVDKTTDMRLHFIRQTHPRGLGDAVLQAKAFVGNEPFVVMLG 

DDLMDITDEKAVPLTKQLMDDYERTHASTIAVMPVPHDEVSAYGVIAPQGEGKDGLYSVETFVEKPAPE 

DAPSDLAIIGRYLLTPEIFEILEKQAPGAGNEIQLTDAIDTLNKTQRVFAREFKGARYDVGDKFGFMKT 

SIDYALKHPQVKDDLKNYLIQLGKELTEKE 

SP081 nucleotide (SEQ ID NO; 137) 

CGCTCAAAATACCAGAGGTGTTCAGCTAATCGAGCACGTTTCTCCTCAAATGTTGAAAGCCCAATTGGA 

GAGTGTCTTTTCTGATATTCCACCTCAGGCTGTAAAAACTGGAATGTTGGCTACTACTGAAATCATGGA 

AATCATCCAACCCTATCTTAAAAAACTGGATTGTCCCTATGTCCTTGATCCTGTTATGGTTGCTACAAG 

TGGAGATGCCTTGATTGACTCAAATGCTAGAGACTATCTCAAAACAAACTTACTACCTCTAGCAACTAT 

TATTACGCCAAATCTTCCTGAAGCAGAAGAGATTGTTGGTTTTTCAATCCATGACCCCGAAGACATGCA 

GCGTGCTGGTCGCCTGATTTTAAAAGAATTTGGTCCTCAGTCTGTGGTTATCAAAGGCGGACATCTCAA 

AGGTGGTGCTAAAGATTTCCTCTTTACCAAGAATGAACAATTTGTCTGGGAAAGCCCACGAATTCAAAC 

CTGTCACACCCATGGTACT 

SP081 amino acid (SEQ ID NO: 13 8) 

AQNTRGVQLIEHVSPQMLKAQLESVFSDIPPQAVKTGMLATTEIMEIIQPYLKKLDCPYVLDPVMVATS 
GDALIDSNARDYLKTNLLPLATIITPNLPEAEEIVGFSIHDPEDMQRAGRLILKEFGPQSWIKGGHLK 
GGAKDF LFTKNEQFVWES PRI QTCHTHGT 

SP08 2 nucleotide (SEQ ID NO: 139) 

AATTGTACAATTAGAAAAAGATAGCAAATCAGACAAAGAACAAGTTGATAAACTATTTGAATCATTTGA 
TGCATCTTCAGATGAATCTATTTCTAAATTAAAAGAACTATCTGAAACTTCACTTAAAACCGATGCAGG 
TAAAGACTATCTTAATAACAAAGTCAAAGAATCATCTAAAGCAATTGTAGATTTTCATTTGCAAAAAGG 
TTTGGCTTATGATGTTAAAGATTCAGATGACAAATTTAAAGATAAAGCAACTCTTGAAACAAATGTAAA 
AGAAATTACAAAACAAATTGATTTTATCAAAAAAGTTGATGAAACTTTTAAACAAGAGAATTTGGAAGA 
AACTCTTAAATCTCTAAATGATCTTGTTGATAAATATCAAAAACAAATCGAACTTTTGAAGAAAGAAGA 
AGAAAAAGCTGCTGAAAAAGCTGCTGAAAAAGCAAAGGAATCTTCTAGTCAAAGTAATTCTTCTGGTAG 
TGCTT C TAATG AG TC TTAT AATGGATC TTC C AATTC AAATGT AGATT AT AG TTC ATC TG AAC AAACTAA 
TGGATATTCAAATAATTATGGCGGTCAAGATTATTCTGGTTCAGGAGATAGTTCAACAAATGGTGGATC 
ATCAGAACAATATTCATCTAGCAATTCAAACAGCGGAGCAAATAATGTCTACAGATATAAAGGCACTGG 
TGCTGACGGCTATCAAAGATACTACTACAAAGATCATAATAATGGAGATGTGTATGATGACGATGGAAA 
TTACCTTGGGAACTTTGGTGGCGGCATTGCAGAACCTAGTCAACGC 

SP082 amino acid (SEQ ID NO:140) 

IVQLEKDSKSDKEQVDKLFESFDASSDESISKLKELSETSLKTDAGKDYLNNKVKESSKAIVDFHLQKG 

LAYDVKDSDDKFKDKATLETNVKEITKQIDFIKKVDETFKQENLEETLKSLNDLVDKYQKQIELLKKEE 

EKAAEKAAEKAKESSSQSNSSGSASNESYNGSSNSNVDYSSSEQTNGYSNNYGGQDYSGSGDSSTNGGS 

SEQYSSSNSNSGANNVYRYKGTGADGYQRYYYKDHNNGDVYDDDGNYLGNFGGGIAEPSQR 

SP03 3 nucleotide (SEQ ID NO: 141) 

TCTGACCAAGCAAAAAGAAGCAGTCAATGACAAAGGAAAAGCAGCTGTTGTTAAGGTGGTGGAAAGCCA 

GGCAGAACTTTATAGCTTAGAAAAGAATGAAGATGCTAGCCTAAGAAAGTTACAAGCAGATGGACGCAT 

CACGGAAGAACAGGCTAAAGCTTATAAAGAATACAATGATAAAAATGGAGGAGCAAATCGTAAAGTCAA 

TGAT 

SP083 amino acid (SEQ ID NO:142) 

LTKQKEAVNDKGKAAWKWESQAELYSLEKNEDASLRKLQADGRITEEQAKAYKEYNDKNGGANRKVN 

D 

SP084 nucleotide (SEQ ID NO: 143) 

GTCCGGCTCTGTCCAGTCCACTTTTTCAGCGGTAGAGGAACAGATTTTCTTTATGGAGTTTGAAGAACT 

CTATCGGGAAACCCAAAAACGCAGTGTAGCCAGTCAGCAAAAGACTAGTCTGAACTTAGATGGGCAGAC 

GCTTAGCAATGGCAGTCAAAAGTTGCCAGTCCCTAAAGGAATTCAGGCCCCATCAGGCCAAAGTATTAC 

ATTTGACCGAGCTGGGGGCAATTCGTCCCTGGCTAAGGTTGAATTTCAGACCAGTAAAGGAGCGATTCG 

CTATCAATTATATCTAGGAAATGGAAAAATTAAACGCATTAAGGAAACAAAAAAT 
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SP084 amino acid (SEQ ID NO: 144) 

SGSVQSTFSAVEEQIFFMEFEELYRETQKRSVASQQKTSLNLDGQTLSNGSQKLPVPKGIQAPSGOSIT 

FDRAGGNS S LAKVEFQTS KGAI RYQLYLGNGKI KRI KETKN QIT 

SP085 nucleotide (SEQ ID NO: 145) 

GGGACAAATTCAAAAAAATAGGCAAGAGGAAGCAAAAATCTTGCAAAAGGAAGAAGTCTTGAGGGTAGC 

TAAGATGGCCCTGCAGACGGGGCAAAATCAGGTAAGCATCAACGGAGTTGAGATTCAGGTATTTTCTAG 

TGAAAAAGGATTGGAGGTCTACCATGGTTCAGAACAGTTGTTGGCAATCAAAGAGCCA 

SP08S amino acid (SEQ ID NO:146) 

GQIQKNRQEEAKILQKEEVLRVAKMALQTGQNQVSINGVEIQVFSSEKGLEVYHGSEQLLAIKEP 

SP086 nucleotide (SEQ ID NO: 147) 

TCGCTACCAGCAACAAAGCGAGCAAAAGGAGTGGCTCTTGTTTGTGGACCAACTTGAGGTAGAATTAGA 
CCGTTCGCAGTTCGAAAAAGTAGAAGGCAATCGCCTATACATGAAGCAAGATGGCAAGGACATCGCCAT 
CGGTAAGTCAAAGTCAGATGATTTCCGTAAAACGAATGCTCGTGGTCGAGGTTATCAGCCTATGGTTTA 
TGGACTCAAATCTGTACGGATTACAGAGGACAATCAACTGGTTCGCTTTCATTTCCAGTTCCAAAAAGC 
CTTAGAAAGGGAGTTCATCTATCGTGTGGAAAAAGAAAAAAGT '-'-AAAAAGG 

SP086 amino acid (SEQ ID NO: 148) 

RYQQQSEQKEWLLFVDQLEVELDRSQFEKVEGNRLYMKQDGKDIAIGKSKSDDFRKTNARGRGYOPMVv 

GLKSVRITEDNQLVRFHFQFQKGLEREFIYRVEKEKS 

SP087 nucleotide (SEQ ID NO: 149) 

GAACCGACAAGTCGCC CACTATC AAG ACTATGCTTTGAATAAAGAAAAATTGGTTGC TTTTGC TATGGC 

TAAACGAACCAAAGATAAGGTTGAGCAAGAAAGTGGGGAACAGTTTTTTAATCTAGGTCAGGTAAGCTA 

AAAACAAGAAAACTGGG'rT’AGTGAGGAGGG'rTGGTAGGGATAAGAGGCAA.TATGAGTTTCTGTTTCC 

CAGTCAAAATCAAAGAAGAGAAAAGAGATAAAAAGGAAGAGGTAGCGACCGATTCAAGCGAAAAAGT 

GGAGAAGAAAAAATCAGAAGAG AAGC CTGAAAAGAAAG AGAATTC A 
SP087 amino acid (SEQ ID NO:150) 

nrqvahyqdyalnkeklvafamakrtkdkveqesgeqffnlgqvsyqnkktglvtrvrtdksqyeflfo 

SVK I KEEKRDKKEEVATDS S EKVEKKK S EEKPEKKENS 

SP088 nucleotide (SEQ ID NO:151) 

GGTTGTCGGCTGGCAATATATCCCGTTTCCATCTAAAGGTAGTACAATTGGTCCTTACCCAAATGGTAT 

CAGATTAGAAGGTTTTCCAAAGTCAGAGTGGTACTACTTCGATAAAAATGGAGTGCTACAAGAGTTTGT 

TGGTTGGAAAACATTAGAGATTAAAACTAAAGACAGTGTTGGAAGAAAGTACGGGGAAAAACGTGAAGA 

TTCAGAAGATAAAGAAGAGAAGCGTTATTATACGAACTATTACTTTAATCAAAATCATTCTTTAGAGAC 

AGGTTGGCTTTATGATCAGTCTAACTGGTATTATCTAGCTAAGACGGAAATTAATGGAGAAAACTACCT 

ggtggtgaaagacgtgcggggtggataaacgatgattcgacttggtactacctagatccaacaactgg 

TATTATGCAAACAGGTTGGCAATATCTAGGTAATAAGTGGTACTACCTCCGTTCCTCAGGAGCAATGGC 

ACTGGCTGGTATCAGGAAGGTACCACTTGGTATTATTTAGACCACCCAAATGGCGATATGAAAACAGG 

TTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTGGTTGGTATCA 

GATGGTTCAACTTGGTACTACCTAAATGCAGGTAATGGAGACATGAAGACAGGTTGGTTCCAGGTCAA 

TGGCAACTGGTACTATGCTTATAGCTCAGGTGCTTTGGCAGTGAATACGACCGTAGATGGCTATTCTGT 

caactataatggcgaatgggttcgg 

SP088 amino acid (SEQ ID NO: 152) 

WGWQYIPFPSKGSTIGPYPNGIRLEGFPKSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGEKRED 

SEDKEEKRYYTNYYFNQNHSLETGWLYDQSNWYYLAKTEINGENYLGGERRAGWINDDSTWYYLDPTTG 

^^ G ^ QY ^^ t ^ LRSSGAMATGWYQEG ' rnmrLDHPN GDMKTGWQNLGNKWYYLRSSGAMATGWYO 

DGSTWYYLNAGNGDMKTGWFQVNGNWYYAYSSGALAVNTTVDGYSVNYNGEWVR 

SP089 nucleotide (SEQ ID NO: 153) 

GGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTTTATTATCTTGACCAAGATGGAAAGATGAAAAG 

AAATGCTTGGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGACTGGGTCTATGA 

TTCTCAATACGATGCTTGGTTTTATATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTCCAAAT 
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TAAAGGGAAGGACTATTATTTCAAATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTA 

TGTGAATGCTAGTGGTGCCAAAGTACAGCAAGGTTGGCTTTTTGACAAACAATACCAATCTTGGTTTTA 

CATCAAAGAAAATGGAAACTATGCTGATAAAGAATGGATTTTCGAGAATGGTCACTATTATTATCTAAA 

ATCCGGTGGCTACATGGCAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATTTGA 

TGGGAAAATGGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGG 

TGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATCTGATGGGAA 

AATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTA 

CATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTACCTCAAATCTGATGGGAAAATAGC 

TGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCTGGTGGCTACATGGC 

GAAAAATGAGACAGTAGATGGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGAGGAAAAACTACAAA 

TGAAAATGCTGCTTACTATCAAGTAGTGCCTGTTACAGCCAATGTTTATGATTCAGATGGTGAAAAGCT 

TTCCTATATATCGCAAGGTAGTGTCGTATGGCTAGATAAGGATAGAAAAAGTGATGACAAGCGCTTGGC 

TATTACTATTTCTGGTTTGTCAGGCTATATGAAAACAGAAGATTTACAAGCGCTAGATGCTAGTAAGGA 

CTTTATCCCTTATTATGAGAGTGATGGCCACCGTTTTTATCACTATGTGGCTCAGAATGCTAGTATCCC 

AGTAGCTTCTCATCTTTCTGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCCTGCATTTTGA 

TGGTTTTAAGCTTGAGAATCCCTTCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGAAGA 

ATTGGATAAGGTATTTAGTTTGCTAAACATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAA 

GGAAGCCGAAGAACATTACCATATCAATGCTCTTTATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTG 

GGGAAGAAGTAAAATTGCCAAAGATAAGAATAATTTCTTTGGCATTACAGCCTATGATACGACCCCTTA 

CCTTTCTGCTAAGACATTTGATGATGTGGATAAGGGAATTTTAGGTGCAACCAAGTGGATTAAGGAAAA 

TTATATCGATAGGGGAAGAACTTTCCTTGGAAACAAGGCTTCTGGTATGAATGTGGAATATGCTTCAGA 

CCCTTATTGGGGCGAAAAAATTGCTAGTGTGATGATGAAAATCAATGAGAAG 


SP089 amino acid (SBQ ID 110:154) 

aksewvedkgafyyldqdgkmkrnawvgtsyvgatgakviedwvydsqydawfyikadgqhaekewlqi 

KGKDYYFKSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLK ' 
SGGYMAANEWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGK 

iaekewvydshsqawyyfksggymtanewiwdkeswfylksdgkiaekewvydshsqawyyfksggyma 

KNETVDGYQLGSDGKWLGGKTTNENAAYYQWPVTANVYDSDGEKLSYISQGSWWLDKDRKSDDKRLA 
ITI SGLSGYMKTEDLQALDASKDFI PYYESDGHRFYHYVAQNAS I PVASHLSDMEVGKKYYSADGLHFD 
GFKLENPFLFKDLTEATNYSAEELDKVFSLLNINNSLLENKGATFKEAEEHYHINALYLLAHSALESNW 

GRSKI AKDKNNFFGIT AYDTTPYLS AKTFDDVDKG I LGATKW I KENY I DRGRTFLGNKASGMNVEYASD 
PYWGEKIASVMMKINEK 

SP090 nucleotide (SBQ ID NO: 155) 

ATTTGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAATGGTAGAACCTACTACAAAAAGGGGGA 

TCTAAAAGAAACCTACTGGAGAGTGATAGATGGGAAGTACTATTATTTTGATCCTTTATCCGGAGAGAT 

GGTTGTCGGCTGGCAATATATACCTGCTCCACACAAGGGGGTTACGATTGGTCCTTCTCCAAGAATAGA 

GATTGCTCTTAGACCAGATTGGTTTTATTTTGGTCAAGATGGTGTATTACAAGAATTTGTTGGCAAGCA 

agttttagaagcaaaaactgctacgaataccaacaaacatcatggggaagaatatgatagccaagcaga 

GAAACGAGTCTATTATTTTGAAGATCAGCGTAGTTATCATACTTTAAAAACTGGTTGGATTTATGAAGA 
GGGTCATTGGTATTATTTACAGAAGGATGGTGGCTTTGATTCGCGCATCAACAGATTGACGGTTGGAGA 
' GCTAGCACGTGGTTGGGTTAAGGATTACCCTCTTACGTATGATGAAGAGAAGCTAAAAGCAGCTCCATG 
GTACTATCTAAATCCAGCAACTGGCATTATGCAAACAGGTTGGCAATATCTAGGTAATAGATGGTACTA 

CCTCC ATTCGTC AGGAGCT ATGGC AACTGGCTGGT AT AAGGAAGGCTC AAC TTGGT ACT ATCT AGATGC 

TGAAAATGGTGATATGAGAACTGGCTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGG 

AGC TATGGCAACTGGTTGGTATC AGGAAAGTTCG ACTTGGT ACT ATCTAAATGC AAGTAATGG A 

GAAAACAGGCTGGTTCCAAGTCAATGGTAACTGGTACTATGCCTATGATTCAGGTGCTTTAGCTGTTAA 

TACCACAGTAGGTGGTTACTACTTAAACTATAATGGTGAATGGGTTAAG 

SPO 9 0 amino acid (SEQ ID NO: 156) 

VFADDSEGWQFVQENGRTYYKKGDLKETYWRVIDGKYYYFDPLSGEMWGWQYIPAPHKGVTIGPSPRI 

EIALRPDWFYFGQDGVLQEFVGKQVLEAKTATNTNKHHGEEYDSQAEKRVYYFEDQRSYHTLKTGWIYE 

EGHWYYLQKDGGFDSRINRLTVGELARGWVKDYPLTYDEEKLKAAPWYYLNPATGIMQTGWQYLGNRWY 

YLHSSGAMATGWYKEGSTWYYLDAENGDMRTGWQNLGNKWYYLRSSGAMATGWYQESSTWYYLNASNGD 

MKTGWFQ VNGNWYY A YD S G ALA VNTTV GG YY LNYNG EWVK 
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SP091 nucleotide { SEQ ID NO: 157) 

TGTCGCTGCAAATGAAACTGAAGTAGCAAAAACTTCGCAGGATACAACGACAGCTTCAAGTAGTTCAGA 

GCAAAATCAGTCTTCTAATAAAACGCAAACGAGCGCAGAAGTACAGACTAATGCTGCTGCCCACTGGGA 

TGGGGATTATTATGTAAAGGATGATGGTTCTAAAGCTCAAAGTGAATGGATTTTTGACAACTACTATAA 

GGCTTGGTTTTATATTAATTCAGATGGTCGTTACTCGCAGAATGAATGGCATGGAAATTACTACCTGAA 

ATCAGGTGGATATATGGCCCAAAACGAGTGGATCTATGACAGTAATTACAAGAGTTGGTTTTATCTCAA 

GTCAGATGGGGCTTATGCTCATCAAGAATGGCAATTGATTGGAAATAAGTGGTACTACTTCAAGAAGTG 

GGGTTACATGGCTAAAAGCCAATGGCAAGGAAGTTATTTCTTGAATGGTCAAGGAGCTATGATGCAAAA 

TGAATGGCTSCTATGATCCAGCCTATTCTGCTTATTTTTATCTAAAATCCGATGGAACTTATGCTAACC 

AAGAGTGGCAAAAAGTGGGCGGCAAATGGTACTATTTCAAGAAGTGGGGCTATATGGCTCGGAATGAGT 

GGCAAGGCAACTACTATTTGACTGGAAGTGGTGCCATGGCGACTGACGAAGTGATTATGGATGGTACTC 

GCTATATCTTTGCGGCCTCTGGTGAGCTCAAAGAAAAAAAAGATTTGAATGTCGGCTGGGTTCACAGAG 

ATGGTAAGCGCTATTTCTTTAATAATAGAGAAGAACAAGTGGGAACCGAACATGCTAAGAAAGTCATTG 

ATATTAGTGAGCACAATGGTCGTATCAATGATTGGAAAAAGGTTATTGATGAGAACGAAGTGGATGGTG 

TCATTGTTCGTCTAGGTTATAGCGGTAAAGAAGACAAGGAATTGGCGCATAACATTAAGGAGTTAAACC 

GTCTGGGAATTCCTTATGQTGTCTATCTCTATACCTATGCTGAAAATGAGACCGATGCTGAGAGTGACG 

CTAAACAGACCATTGAACTTATAAAGAAATACAATATGAACCTGTCTTACCCTATCTATTATGATGTTG 

AGAATTGGGAATATGTAAATAAGAGCAAGAGAGCTCCAAGTGATACAGGCACTTGGGTTAAAATCATCA 

ACAAGTACATGGACACGATGAAGCAGGCGGGTTATCAAAATGTGTATGTCTATAGCTATCGTAGTTTAT. 

TACAGACGCGTTTAAAACACCCAGATATTTTAAAACATGTAAACTGGGTAGCGGCCTATACGAATGCTT 

TAGAATGGGAAAACCCTCATTATTCAGGAAAAAAAGGTTGGCAATATACCTCTTCTGAATACATGAAAG 

GAATCCAAGGGCGCGTAGATGTCAGCGTTTGGTAT 

SP091 amino acid (SEQ ID NO:158) 

VAANETEVAKTSQDT^TASSSSEQNQSSNKTQTSAEVQTNAAAHWDGDYYVKDDGSKAQSEWIFDNYYK 
AWFYINSDGRYSQNEWHGNYYLKSGGYMAQNEWIYDSNYKSWFYLKSDGAYAHQEWQLIGNKWYYFKKW 
GYMAKSQWQGSYFLNGQGAMMQNEWLYDPAYSAYFYLKSDGTYANQEWQKVGGKWYYFKKWGYMARNEW 
QGNYYLTGSGAMATDEVIMDGTRYIFAASGELKEKKDLNVGWVHRDGKRYFFNNREEQVGTEHAKKVTD 
ISEHNGRINDWKKVIDENEVDGVIVRLGYSGKEDKELAHNIKELNRLGIPYGVYLYTYAENETDAESDA 
KQTIELIKKYNMNLSYPI YYDVENWEYVNKSKRAPSDTGTWVKI INKYMDTMKQAGYQNVYVYSYRSLL 
QTRL KH PD I L KHVNWVAA YTNAL EWENP H YSGKKGWQ YT S S E YMKG I QGRVDVS VWY 

SP092 nucleotide (SEQ ID NO: 159) 

TACGTCTCAGCCTACTTTTGTAAGAGCAGAAGAATCTCCACAAGTTGTCGAAAAATCTTCATTAGAGAA 

GAAATATGAGGAAGCAAAAGCAAAAGCTGATACTGCCAAGAAAGATTACGAAACGGCTAAAAAGAAAGC 

AGAAGACGCTCAGAAAAAGTATGAAGATGATCAGAAGAGAACTGAGGAGAAAGCTCGAAAAGAAGCAGA 

AGCATCTCAAAAATTGAATGATGTGGCGCTTGTTGTTCAAAATGCATATAAAGAGTACCGAGAAGTTCA 

AAATCAACGTAGTAAATATAAATCTGACGCTGAATATCAGAAAAAATTAACAGAGGTCGACTCTAAAAT 

AGAGAAGGCTAGGAAAGAGCAACAGGACTTGCAAAATAAATTTAATGAAGTAAGAGCAGTTGTAGTTCC 

TGAACCAAATGCGTTGGCTGAGACTAAGAAAAAAGCAGAAGAAGCTAAAGCAGAAGAAAAAGTAGCTAA 

GAGAAAATATGATTATGCAACTCTAAAGGTAGCACTAGCGAAGAAAGAAGTAGAGGCTAAGGAACTTGA 

AATTGAAAAACTTCAATATGAAATTTCTACTTTGGAACAAGAAGTTGCTACTGCTCAACATCAAGTAGA 

TAATTTGAAAAAACTTCTTGCTGGTGCGGATCCTGATGATGGCACAGAAGTTATAGAAGCTAAATTAAA 

AAAAGGAGAAGCTGAGCTAAACGCTAAACAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACT 

TCTTGACAGCCTTGATCCTGAAGGTAAGACTCAGGATGAATTAGATAAAGAAGCAGAAGAAGCTGAGTT 

GGATAAAAAAGCTGATGAACTTCAAAATAAAGTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAAT 

ATTACTTGGAGGGGCTGATNCTGAAGATGATACTGCTGCTCTTCAAAATAAATTAGCTACTAAAAAAGC 

TGAATTGGAAAAAACTCAAAAAGAATTAGATGCAGCTCTTAATGAGTTAGGCCCTGATGGAGATGAAGA 

AGAAACTCCAGCGCCGGCTCCTCAACCAGAGCAACCAGCTCCTGCACCAAAACCAGAGCAACCAGCTCC 

AGCTCCAAAACCAGAGCAACCAGCTCCTGCACCAAAACCAGAGCAACCAGCTCCAGCTCCAAAACCAGA 

GCAACCAGCTCCAGCTCCAAAACCAGAGCAACCAGCTAAGCCGGAGAAACCAGCTGAAGAGCCTACTCA 

ACCAGAAAAACCAGCCACTCCAAAAACAGGCTGGAAACAAGAAAACGGTATGTGGTATTTCTACAATAC 

TGATGGTTCAATGGCAATAGGTTGGCTCCAAAACAACGGTTCATGGTACTACCTAAACGCTAACGGCGC 

TATGGCAACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGC 

AAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATGTCAACAGCAATGGCGCTATGGCGACAGGCTG 

GCTCCAATACAATGGCTCATGGTACTACCTCAACGCTAATGGTGATATGGCGACAGGATGGCTCCAATA 

CAACGGTTCATGGTATTACCTCAACGCTAATGGTGATATGGCGACAGGATGGGCTAAAGTCAACGGTTC 

ATGGTACTACCTAAACGCTAACGGTGCTATGGCTACAGGTTGGGCTAAAGTCAACGGTTCATGGTACTA 
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CCTAAACGCTAACGGTTCAATGGCAACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGC 

ATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATGTCAATGGCTTAGG 

TGCCCTTGCAGTCAACACAACTGTAGATGGCTATAAAGTCAATGCCAATGGTGAATGGGTT 

SP092 amino acid (SEQ ID NO:160) 

TSQPTFVRAEESPQWEKSSLEKKYEEAKAKADTAKKDYETAKKKAEDAQKKYEDDQKRTEEKARKEAE 

ASQKLNDVALWQNAYKEYREVQNQRSKYKSDAEYQKKLTEVDSKIEKARKEQQDLQNKFNEVRAVWP 

EPNALAETKKKAEEAKAEEKVAKRKYDYATLKVALAKKEVEAKELEIEKLQYEISTLEQEVATAQHQVD 

NLKKLLAGADPDDGTEVIEAKLKKGEAELNAKQAELAKKQTELEKLLDSLDPEGKTQDELDKEAEEAEL 

DKKADELQNKVADLEKEISNLEILLGGADXEDDTAALQNKLATKKAELEKTQKELDAALNELGPDGDEE 

ETPAPAPQPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAKPEKPAEEPTQ 

P EKP AT P KTGWKQ ENGMWY F YNTDG SMA I GWLQNNG S WY YLNANG AMATGWVKDGDTWYYL EASG AMKA 

SQWFKVSDKWYYVNSNGAMATGWLQYNGSWYYLNANGDMATGWLQYNGSWYYLNANGDMATGWAKVNGS 

WYYLNANG7VMATGWAKVNGSWYYLNANGSMATGWVKDGDTWYYLEASGAMKASQWFKVSDKWYYVNGLG 

ALAVNTTVDG YKVNANG EWV 

P093 nucleotide (SEQ ID NO: 161) 

TGGACAGGTGAAAGGTCATGCTACATTTGTGAAATCCATGACAACTGAAATGTACCAAGAACAACAGAA 
CCATTCTCTCGCCTACAATCAACGCTTGGNTTCGCAAAATCGCATTGTAGATCCTTTTTTGGCGGAGGG 
ATATGAGGTCAATTACCAAGTGTCTGACGACCCTGATGCAGTCTATGGTTACTTGTCTATTCCAAGTTT 
GGAAATCATGGAGCCGGTTTATTTGGGAGCAGATTATCATCATTTAGGGATGGGCTTGGCTCATGTGGA 
TGGTACACCGCTGCCTCTGGATGGTACAGGGATTCGCTCAGTGATTGCTGGGCACCGTGCAGAGCCAAG 
CCATGTCTTTTTCCGCCATTTGGATCAGCTAAAAGTTGGAGATGCTCTTTATTATGATAATGGCCAGGA 
AATTGTAGAATATCAGATGATGGACACAGAGATTATTTTACCGTCGGAATGGGAAAAATTAGAATCGGT 
TAGCTCTAAAAATATCATGACCTTGATAACCTGCGATCCGATTCCTACCTTTAATAAACGCTTATTAGT 
GAATTTTGAACGAGTCGCTGTTTATCAAAAATCAGATCCACAAACAGCTGCAGTTGCGAGGGTTGCTTT 
TACGAAAGAAGGACAATCTGTATCGCGTGTTGCAACCTCTCAATGGTTG * 

SP09 3 amino acid ( SEQ ID NO : 16 2 ) 

GQVKGHATFVKSMTTEHYQEQQNHSLAYNQRLXSQNRIVDPFLAEGYEVNYQVSDDPDAVYGYLSIPSL 

EIMEPVYLGADYHHLGMGLAHVDGTPLPLDGTGIRSVIAGHRAEPSHVFFRHLDQLKVGDALYYDNGQE 

IVEYQMMDTEXILPSEWEKLESVSSKNIMTLITCDPlPTFNKRLLVNFERVAVYQKSDPQTAAVARVAr 

TKEGQSVSRVATSQWL 

SP094 nucleotide (SEQ ID NO: 163) 

GATTGCTCCTTTGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCTTCGTGC 

CAAGGAAGTTGGTGCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTGGATCAGATTGA 

TCAGTTGATGGTAGCTATTCGTAGCCAGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCTTTCGAG 

CCAGATTAATCCACATTTCCTCTATAACACTTTGGACACCATCATCTGGATGGCTGAATTTCATGATAG 

TCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCAACCTATTTCCGCTTGGCGCTCAATCAAGGCAAGGA 

CTTGATTTGTCTCTCTGACGAAATCAATCATGTCCGCCAGTATCTCTTTATCCAGAAACAACGCTATGG 

agataagctggaatacgaaattaatgaaaatgttgcctttgataatttagtcttacccaagctggtcct 

ACAACCCCTTGTAGAAAATGCTCTTTACCATGGCATTAAGGAAAAGGAAGGTCAGGGCCATATTAAACT 

TTC TGTC C AG AAAC AGG ATTCGGG ATTGGTC AT C CGT ATTG AGG ATG ATGGCGTTGGC TTC C AAG ATG C 

TGGTGATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCTCAA 

ACTTCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGAAAGTTGAAAT 

ATATATAAATAGAATAGAAACTAGC 

SP094 amino acid (SEQ ID NO: 164) 

IAPLKDLRETMLEIASGAQNLRAKEVGAYELREVTRQFNAMLDQIDQLMVAIRSQEETTRQYQLQALSS 
QINPHFLYNTLDTIIWMAEFHDSQRWQVTKSLATYFRLALNQGKDLICLSDEINHVRQYLr IQKQRYG 

DKLEYEINENVAFDNLVXjPKLVLQPLVENALYHGIKEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDA 

GDSSQSQLKRGGVGLQNVDQRLKLHFGANYHMKIDSRPQKGTKVEIYINRIETS 

SP095 nucleotide (SEQ ID NO: 165) 

TAGGTCATATGGGACTTTTTTTCTACAACAAAATAGGCTCCATAATATCTATAAGGGATTTACCCACTA 

CAAATATTATAGAGCCGAAAATTCACATCTAATATATGCAGACTACTTTGAAATGAAATTAAAAAAATT 

ATTAAAGGATGACACAAAAGTTTTTGAAAAATCTACATTCAAATTTGTAGAAGGATATAAAATATACCT 
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GACAGAATCTAAAGAATCTGGAATTAAACAAATGGACAATGTCATAAAATATTTTGAGTTTATTGAATC 

TAAAAGTATTGCTTTATATTTTCAAAAACGATTAAATGAGCTGATAGAT 

SP095 amino acid (SEQ ID NO: 166) 

RSYGTFFLQQNRLHNIYKGFTHYKYYRAENSHLIYADYFEMKLKKLLKDDTKVFEKSTFKFVEGYKIYL 

TESKESGIKQMDNVIKYFEFIESKSIALYFQKRLNELID 

SP096 nucleotide (SEQ ID NO: 167) 

CAACGTTGAGAATTATTTGCGAATGTGTTTGGATAGCATTCAGAATCAGACGTATCAAAATTTTGAGTG 

TTTATTAATCAATGATGGCTCTCCAGATCATTCATCCAAAATATGTGAAGAATTTGTAGAGAAAGATTC 

TCGTTTCAAATATTTTGAGAAAGCAAACGGCGGTCTTTCATCAGCTCGTAACCTAGGTATTGAATGTTC 

GGGGGGGGGCGTACATTACTTTTGTAGACTC 

SPQ9 6 amino acid (SEQ ID NO: 168) 

NVENYLRMCLDS IQNQTYQNFEC LLINDGSPDHSSKICEEFVEKDSRFKYFEKANGGLS 3 ARNLG I ECS 
GGGVHYFCRL 

SP097 nucleotide (SEQ ID NO: 169) 

CTACTATCAATCAAGTTCTTCAGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGACCATCAGCCA 

GACTAGCCACTTTATTCAGTCTTATATCAAAAAACTAGAAACCACCTCGACTGGTTTGACCCAGCAGAC 

GGATGTTCTGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTTTTGAC 

CATCTTGAAGTCAGATAAGGACTTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTCATTTCTAC 

AGATGACAGTGTGCAGATGAAAACTTCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCA 

TCAGGGAGCTATGCCTGTTTTGACTCCAGCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCA 

AGAACTTGTTGATGCAAAGGGAGCCAATCTTGGTGTGCTTCGTTTGGATATTTCTTATGAAACTCTGGA 

AGCCTATCTCAATCAACTCCAGTTGGGGCAGCAGGGCTTTGCCTTCATTATCAATGAAAACCATGAATT 

TGTCTACCATCCTCAACACACAGTTTATAGTTCGTCTAGCAAAATGGAGGCTATGAAACCCTACATCGA 

TACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATTGCAGGAACTGATTG 

GACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAG 

SP097 amino acid (SEQ ID NO: 17 0) 

YYQSSSSAIEATIEGNSQTTISQTSHFIQSYIKKLETTSTGLTQQTDVLAYAENPSQDKVEGIRDLFLT 

ILKSDKDLKTWLVTKSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQWVISVTQ 

ELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENKEFVYHPQHTVYSSSSKMEAMKPYID 

TGQGYTPGHKSYVSQEKIAGTDWTVLGVSSLEKLDQVRSQ 

SP09B nucleotide (SEQ ID NO: 171) 

GACAAAAACATTAAAACGTCCTGAGGTTTTATCACCTGCAGGGACTTTAGAGAAGCTAAAGGTAGCTGT 

TCAGTATGGAGCAGATGCTGTCTTTATCGGTGGTCAGGCCTATGGTCTTCGTAGCCGTGCGGGAAACTT 

TACTTTCGAACAGATGGAAGAAGGCGTGCAGTTTGCGGCCAAGTATGGTGCCAAGGTCTATGTAGCGGC 

TAATATGGTTATGCACGAAGGAAATGAAGCTGGTGCTGGTGAGTGGTTCCGTAAACTGCGTGATATCGG 

GATTGCAGCAGTTATCGTATCTGACCCAGCCTTGATTATGATTGCAGTGACTGAAGCACCAGGCCTTGA 

AATCCACCTTTCTACCCAAGCCAGTGCCACTAACTATGAAACCCTTGAGTTCTGGAAAGAGCTAGGCTT 

GACTCGTGTCGTTTTAGCGCGTGAGGTTTCAATGGAAGAATTAGCTGAGATCCGCAAACGTACAGATGT 

TGAAATTGAAGCCTTTGTCCATGGAGCTATGTGTATTTCATACTCTGGACGTTGTACTCTTTCAAACCA 

CATGAGTATGCGTGATGCCAACCGTGGTGGATGTTCTCAGTCATGCCGTTGGAAATACGACCTTTACGA 

TATGCCATTTGGGAAAGAACGTAAGAGTTTGCAGGGTGAGATTCCAGAAGAATTTTCAATGTCAGCCGT 

TGACATGTCTATGATTGACCANATTCCAGATATGATTGAAAATGGTGTGGACAGTCTAAAAATCGAAGG 

ACGTATGNAGTCTATTCACTANGTATCAACAGTAACCAACTGCTACAAGGCGGCTGTGGATGCCTATCT 

TGAAAGTCCTGAAAAGTTTGAAGCTATCAAACAAGACTTGGTGGACGAGATGTGGAAGGTTGCCCAACG 

TGAACTGGCTACAGGATTTTACTATGGTACACCATCTGAAAATGAGCAGTTGTTTGGTGCTCGTCGTAA 

AATCCCTGAGTACAAGTTTGTCGCTGAAGTGGTTTCTTATGATGATGCGGCACAAACAGCAACTATTCG 

TCAACGAAACGTCATTAACGAAGGGGACCAAGTTGAGTTTTATGGTCCAGGTTTCCGTCATTTTGAAAC 

CTATATTGAAGATTTGCATGATGCTAAAGGCAATAAAATCGACCGCGCTCCAAATCCAATGGAACTATT 

GACTATTAAAGTCCCACAACCTGTTCAATCAGGAGACATGGTTCGAGCTCTTAAAGAGGGGCTTATCAA 

TCTTTATAAGGAAGATGGAACCAGCGTCACAGTTCGTGCT 
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SP098 amino acid (SEQ ID NO:172) 

TKTLKRPEVLSPAGTLEKLKVAVQYGADAVFIGGQAYGLRSRAGNFTFEQMEEGVQFAAKYGAKVYVAA 

NMVMHEGNEAGAGEWFRKLRDIGIAAVIVSDPALIMIAVTEAPGEEIHLSTQASATNYSTLEFWKELGL 

TRWLAREVSMEELAEIRKRTDVEIEAFVHGAMCISYSGRCTLSNHMSMRDANRGGCSQSCRWKYDLYD 

MPFGKERKSLQGEIPEEFSMSAVDMSMIDXIPDMIENGVDSLKIEGRMXSIHXVSTVTNCYKAAVDAYL 

ESPEKFEAIKQDLVDEMWKVAQRELATGFYYGTPSENEQLFGARRKIPEYKFVAEWSYDDAAQTATIR 

QRNVINEGDQVEFYGPGFRHFETYIEDLHDAKGNKIDRAPNPMELLTIKVPQPVQSGDMVRALKEGLIN 

LYKEDGTSVTVRA 

SP099 nucleotide (SEQ ID NO: 173) 

TTCTCAGGAGACCTTTAAAAATATCACCAATAGCTTCTCCATGCAAATCAATCGTCGCGTCAACCAAGG 
AACGCCTCGTGGTGCTGGGAATATCAAGGGTGAAGACATCAAAAAAATCACCGAAAACAAGGCCATTGA 
GTCTTATGTCAAACGTATCAACGCTATCGGAGATTTGACTGGATATGACCTGATTGAAACGCCAGAAAC 
CAAGAAGAATCTCACTGCTGATCGTGCCAAGCGTTTTGGAAGTAGCTTGATGATTACAGGTGTCAATGA 
CTCCTCTAAAGAAGACAAGTTTGTCTCTGGTTCTTATAAACTAGTCGAAGGAGAGCACTTAACCAACGA 
C G AC AAGGAT AAAATC C TC TTGC AC AAGG ACTTGGC AGC C AAA C A.CG GC TGGAAAGT AGGGG A C AAGGT 
TAAACTGGACTCTAATATCTACGATGCAGATAATGAAAAAGGAGCCAAGGAAACAGTTGAAGTGACAAT 
CAAGGGACTCTTTGATGGTCATAATAAGTCAGCAGTAACCTACTCACAAGAACTTTACGAAAACACAGC 
TATTACAGACATTCACACTGCTGCAAAACTTTATGGATACACAGAAGACACAGCCATTTATGGGGACGC 
AACCTTCTTTGTAACAGCAGACAAGAACTTGGATGATGTTATGAAAGAGTTGAATGGCATCAGTGGTAT 
CAACTGGAAGAGCTACACACTCGTCAAGAGCTCCTCTAACTACCCAGCTCTTGAGCAATCTATCTCTGG 
TATGTACAAGATGGCCAAC 

SP099 amino acid (SEQ ID NO: 174) 

SQETFKNITNSFSMQJNRRVNQGTPRGAGNIKGEDIKKITENKAIESYVKRINAIGDLTGYDLIETPET 

KKNLTADRAKRFGSSLMITGVNDSSKEDKFVSGSYKLVEGEHLT>JDDKDKILLHKDLAAKHGWKVGDKV 

KLDSNIYDADNEKGAKETVEVTIKGLFDGHNKSAVTYSQELYENTAITDIHTAAKLYGYTEDTAIYGDA 

TFFVTADKNLDDVMKELNGISGINWKSYTLVKSSSNYPALEQSISGMYKMAN 

SP100 nucleotide (SEQ ID NO: 175) 

AGTAAATGCGCAATCAAATTCATTAATATTAATAGATGAACCTGAAATCTCACTTCATCCGAGTGCAAT 

CTATAAATTTAAAGAGTTTTTACTTCAAGAGTGTTTAAATAAAAAACATCAAATTATTATCACTACACA 

TTCTACACAACTTATAAAAGATTTTCCTAGAGAAGCCGTGAAACTTTTAGTGAAAAACGGAGAAAAGGT 

AGATGTTATTGAAAATATTGATTATCAGGATGCATTTTTTGAATTAGGTGATGTGTATCATTCTAGGAA 

GATGATTTATGTTGAAGATAGACTAGCTAAATATATTCTAGAGTTTGTTATCACTCATTCAGGTAGTGA 

GAATCTTAAACAGAATTTAGTAGTGAGATATATTCCTGGTGGAGCAAATCAAATAATTTGTAATAATAT 

TTTAAACTCATCGTATTTAGATTCCGATAACCATTATTTTTGGCTTGATGGAGATCAAAACACTAATGT 

TAGTGAATCAAATAATTTAATGAACTATCTTGAAAATGGTGTTGTTATATCAGATAAAATTCCTGAATC 

AGATAATAAAAATCTTGATGATATTATAAAATTGATAANGGGATGTCCAATTAAATTTAATGTTTCAGG 

TAATAAAGGGCAAAAAAATAATATTGAATTAATTGCGAAACAAAGAAGCTTTATAGATTATTGGGCTAA 

ATAC 

SP100 amino acid (SEQ ID NO: 176) 

VNAQSNSLILIDEPEISLHPSAIYXFKEFLLQECLNKKHQIIITTHSTQLIKDFPREAVKLLVKNGEKV 
DVIENIDYQDAFFELGDVYHSRKMIYVEDRLAKYILEFVITHSGSENLKQNLWRYIPGGANQIICNNI 
LNSSYLDSDNHYFV/LDGDQNTNVSESNNLMNYLENGWISDKIPESDNKNLDDIIKLIXGCPIKFNVSG 
NKGQKNNI EL I AKQRS F I D YWAKY 

SP101 nucleotide (SEQ ID NO:177) 

TTACCGCGTTCATCAAGATGTCAAACAAGTCATGACCTATCAACCCATGGTGCGAGAAATATTGAGTGA 

ACAAGACACCCCAGCAAACGAAGAGCTTGTGCTTGCTATGATTTATACTGAAACAAAAGGAAAAGAAGG 

CGATGTTATGCAGTCTAGTGAGTCTGCAAGTGGTTCCACCAACACCATCAATGATAATGCCTCTAGCAT 

TCGGCAAGGCATTCAAACTCTGACAGGCAATCTCTATCTGGCGCAGAAGAAGGGGGTAGATATCTGGAC 

AGCTGTTCAAGCCTATAATTTTGGACCTGCCTATATCGATTTTATCGCCCAAAATGGCAAGGAAAATAC 

CCTGGCTCTAGCCAAACAGTACTCTCGTGAGACTGTTGCCCCCTTGCTTGGTAATAGGACTGGAAAGAC 

TTATAGTTATATTCACCCCATTTCCATTTTTCACGGTGCTGAACTCTATGTAAATGGAGGAAACTATTA 

TTATTCTAGACAGGTACGACTTAACCTTTACATCATCAAATGTTTCACTCTCTTTTCAACATCTGGC 
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SP101 amino acid (SEQ ID NO: 178) 

^™^^ tyqpmvreilseqdtp ^ eelvl amiytetkgkegdvmqssesasgstntindnassi 

rqgiqtltgnl/laqkkgvdiwtavqaynfgpayidfiaqngkentlalakqysretvapllgnrtgkt 

YSYIHP I S I FHGAEL YVNGGNYY’ { SRQVRLNLY 1 1 KCFTLFSTSG ^bKETVAPLLGNRTGKT 

SP102 nucleotide (SEQ ID NO: 179 ) 

^^ GGGCTTTAACTATCTTCGTATTCGCCGTGCGGCTAAAATTGTGGA caatgaggagtttgaagc 

“ GTACGGGTCAATTGA ^ GA TTTGCGCGACCCAGCAGAATTCCACAGAAAACATATCCTTGG 

t „ G ^ GGCAATATTCCTTCAAGTCAGTTGAAAACTA G t CTTGCAGCCCTTCGTAAAGATAAACCTGTCCT 

TCTACGAAAACCAACGTGCGCAACGAGTTACAAATGCAGCTCTTTACTTGAAAAAACAAGGT’^TTC 

tgagatttatatcctttcttatggcttggattgttggaaagggaaagtgaagactagc 

SP102 amino acid (SEQ ID NO: 180) 

WMGFNYLRIRRAAKIVDNEEFEALIRTGQLIDLRDPAEFHRKHILGARNIPSSOLKTSLAALRKDKPvr 
LYENQRAQRVTNAALYLKKQGFSEIYILSYGLDSWKGKVKTS sqlktslaalrkdkpvl 

SP103 nucleotide (SEQ ID NO:181) 

AG ^^ ccagcatcgttcgcaggaaaataaggacaataa tcgtgtctcttatgtggatggcagccagtc 
^^ GAGAAAAG I gaaaacttgacaccagaccaggttagccagaa agaa gg a at tcaggctgagc a St 
ar .I^ G ^ ATTACAGATCAGGGCTATGTAACGTCACA CGGTGACCACTATCATTACTATAATGGGAA 
^ CTTATGATGCCCTGTTTAGTGAAGAACTCTTGATGAAGGATCCAAACTATCAACT'T'AAAGACGG 
TGATATTGTCAATGAAGTCAAGGGTGGTTATATCATCAAGGTCGATGGAAAATATTATGTGTACCTGAA 

if'a^ GCAGCTCATGCTGATAATCTTCGAACTAAAGATGAAATCAATCGTCAAAAACAAGAAG ATGTCAA 

AGATAATGAGAAGGTTAACTCTAATGTTGCTGTAGCAAGGTCTCAGGGACGATATACGAC AAATGATGG 

ttatgtgtttaatccagctgatattatggaagataggggtaatgcttatatggttcctca^IggSa 

ATCACTACATTCCCAAAAGCGATTTATCTGCTAGTGAATTAGCAGCAGCTAAAGCACATCTGGCTGG 

aaaaaatatg ca a ccgag t c agtt.a agct a ttc t tc aa c a g ct a gt ga caataacacgcaatctgtagc 

f^^ GGA ^ aactagcaagccagcaaataaatctgaaaa tctccagagtgttttgaaggaactctatga 

IT GACCTAGCGCCCAACGTTACAGTGAATCAGA,rGGCCTGGTG TTTGACCCTGCTAAGATTATCAGTCG 

CT?ArA^^S GTTGCGATTCCGCATGGCGACCATTACCACTTTATTCCTTACAGCAAGCT TTCTGC 

CTTAGAAGAAAAGATTGCCAGAATGGTGCCTATCAGTGGAACTGGTTCTACAGTTTCTACAAATGCAAA 

ACCTAATGAAGTAGTGTCTAGTCTAGGCAGTCTTTCAAGCAATCCTTCTTCTTTAACGACAAGTAAGGA 

gctgtgttcaggatctgatggttatatttgtaatggaaaagatatggttgaagI^cgSc^S^tS 

A^r?I^^- TGGTGATCATTTCCATTACATTCCAAAATCAAATCAAATTGGGCAACCGACTG TTCC 

AAACAATAGTC . AGCAACACCTTCTCCATCTCTTCCAATCAATCCAGGAACTTCACATGAGAAACATGA 

rr-ra^a TGGATACGGATTTGATGCTAATCGTATTATCGCTGAAGATGAATCAGGTTTTGTCA TGAGTCA 

CGGAGACCACAATCATTATTTCTTCAAGAAG 

SP103 amino acid (SEQ ID NO: 182) 

VP^Af2^^ RVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDGGYVTSHG DHYHYYNGK 

nM^^^ SEELLMKDPWQLKDADIWE ^ GGYIIKVDGKYYVYLKDAAHADNVRTKD EINRQKQEHVK 

^™^ NSNVAVARSQGRYTTNDG ' /vfnpadiiedtgnayivphgghyhyipksdlsas elaaakahlag 

k^qpsqlsysstasdnntqsvakgstskpanksenlqsllkelydspsaqrysesdglvfdpakiisr 

^™ G Y^ phgdkyhfipysklsalee ki a ^pis g tg S tvstn a kpnewsslgslssnpssSske 

asdgyifnpk °iveetatayivrhgdhfhyipksnqigqptlpnnslatpspslpinpgtshekhe 

edgygfdanri I aedesgfvmshgdhnhyffkk gtshekhe 
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TGGTATCCAAGAAGGTGCTGCTAACTCAATCCTTATCAAAGTTAACCAAATCGGTACTCTTACTGAAAC 

TTTTGAAGCTATCGAAATGGCTAAAGAAGCTGGTTACACTGCTGTTGTATCACACCGTTCAGGTGAAAC 

TGAAGATTCAACAATCGCTGATATTGCAGTTGCAACTAACGCAGGACAAATCAAGACTGGTTCACTTTC 

ACGTACAGACCGCATCGCTAAATACAACCAATTGCTTCGTATCGAAGACCAACTTGGTGAAGTAGCTGA 

ATATCGTGGATTGAAATCATTCTACAACCTTAAAAAA 

SP105 amino acid (SEQ ID NO: 184) 

DYLEIPLYSYLGGFNTKVLPTPMMNIINGGSHSDAPIAFQEFMILPVGAPTFKEALRYGAEIFHALKKI 

LKSRGLETAVGDEGGFAPRFEGTEDGVETILAAIEAAGYVPGKDVFIGFDCASSEFYDKERKVYDYTKF 

EGEGAAVRTSAEQIDYLEELVNKYPIITIEDGMDENDWDGWKALTERLGKXVQLVGDDFFVTNTDYLAR 

GIQEGAANSILIKVNQIGTLTETFEAIEMAKEAGYTAWSHRSGETEDSTIADIAVATNAGQIKTGSLS 

RTDRIAKYNQLLRIEDQLGEVAEYRGLKSFYNLKK 

SP106 nucleotide (SEQ ID NO:185) 

TCGTATCTTTTTTTGGAGCAATGTTCGCGTAGAAGGACATTCCATGGATCCGACCCTAGCGGATGGCGA 

AATTCTCTTCGTTGTAAAACACCTTCCTATTGACCGTTTTGATATCGTGGTGGCCCATGAGGAAGATGG 

CAATAAGGACATCGTCAAGCGCGTGATTGGAATGCCTGGCGACACCATTCGTTACGAAAATGATAAACT 

CTACATCAATGACAAAGAAACGGACGAGCCTTATCTAGCAGACTATATCAAACGCTTCAAGGATGACAA 

ACTCCAAAGCACTTACTCAGGCAAGGGCTTTGAAGGAAATAAAGGAACTTTCTTTAGAAGTATCGCTCA 

AAAAGCTCAAGCCTTCACAGTTGATGTCAACTACAACACCAACTTTAGCTTTACTGTTCCAGAAGGAGA 

ATACCTTCTCCTCGGAGATGACCGCTTGGTTTCGAGCGACAGCCGCCACGTAGGTACCTTCAAAGCAAA 

AGATATCACAGGGGAAGCTAAATTCCGCTTATGGCCAATCACCCGTATCGGAACATTT 

SP106 amino acid (SEQ ID NO: 186) 

RIFFWSNVRVEGHSMDPTLADGEILFWKHLPIDRFDIWAHEEDGNKDIVKRVIGMPGDTIRYENDKL 

YINDKETDEPYLADYIKRFKDDKLQSTYSGXGFEGNKGTFFRSIAQKAQAFTVDVNYNTNFSFTVPEGE 

YLLLGDDRLVSSDSRHVGTFXAKDITGEAKFRLWPITRIGTF 

SP107 nucleotide (SEQ ID NO:187) 

GGACTCTCTCAAAGATGTGAAAGCAAATGCTAGCGACAGCAAGCCTGCACAGGACAAGAAGGATGCAAA 

ACAAGGAACGGAAGATAGTAAGGATTCAGATAAGATGACTGAAACAAACTCAGTTCCGGCAGGAGTGAT 

TGTGGTCAGTCTACTTGCCCTCCTAGGCGTGATTGCCTTCTGGCTGATTCGCCGTAAGAAAGAGTCAGA 

AATCCAGCAATTAAGCACGGAATTGATCAAGGTTCTAGGACAGCTAGATGCAGAAAAAGCGGATAAAAA 

AGTCCTTGCCAAAGCCCAAAACCTTCTCCAAGAAACCCTTGATTTCGTGAAAGAAGAAAATGGCTCAGC 

AGAGACAGAAACTAAACTAGTAGAGGAGCTTAAAGCAATCCTTGACAAACTCAAG 

SP107 amino acid (SEQ ID NO: 188) 

DSLKDVKANASDSKPAQDKKDAXQGTEDSKDSDKMTETNSVPAGVIVVSLLALLGVIAFWLIRRXKESE 

IQQLSTELIKVLGQLDAEKADKKVLAKAQNLLQETLDFVKEENGSAETETKLVEELKAILDKLK 

SP108 nucleotide (SEQ ID NO: 189) 

CAAGAAATCCTATCATCTCTTCCAGAAGCAAACAGAGACGAGGGGAATTCAGACTCAGTTGATTGAAGA 

ATCGCTTAGTCAGCAGACTATAATCCAGTCCTTCAATGCTCAAACAGAATTTATCCAAAGATTGCGTGA 

GGCTCATGACAACTACTCAGGCTATTCTCAGTCAGCCATCTTTTATTCTTCAACGGTCAATCCTTCGAC 

TCGCTTTGTAAATGCACTCATTTATGCCCTTTTAGCTGGAGTAGGAGCTTATCGTATCATGATGGGTTC 

AGCCTTGACCGTCGGTCGTTTAGTGACTTTTTTGAACTATGTTCAGCAATACACCAAGCCCTTTAACGA 

TATTTCTTCAGTGCTAGCTGAGTTGCAAAGTGCTCTGGCTTGCGTAGAGCGTATCTATGGAGTCTTAGA 

TAGCCCTGAAGTGGCTGAAACAGGTAAGGAAGTCTTGACGACCAGTGACCAAGTTAAGGGAGCTATTTC 

CTTTAAACATGTCTCTTTTGGCTACCATCCTGAAAAAATTTTGATTAAGGACTTGTCTATCGATATTCC 

AGCTGGTAGTAAGGTAGCCATCGTTGGTCCGACAGGTGCTGGAAAATCAACTCTTATCAATCTCCTTAT 

GCGTTTTTATCCCATTAGCTCGGGAGATATCTTGCTGGATGGGCAATCCATTTATGATTATACACGAGT 

ATCATTGAGACAGCAGTTTGGTATGGTGCTTCAAGAAACCTGGCTCACACAAGGGACCATTCATGATAA 

TATTGCCTTTGGCAATCCTGAAGCCAGTCGAGAGCAAGTAATTGCTGCTGCCAAAGCAGCTAATGCAGA 

CTTTTTCATCCAACAGTTGCCACAGGGATACGATACCAAGTTGGAAAATGCTGGAGAATCTCTCTCTGT 

CGGCCAAGCTCAGCTCTTGACCATAGCCCGAGTCTTTCTGGCTATTCCAAAGATTCTTATCTTAGACGA 

GGCAACTTCTTCCATTGATACACGGACAGAAGTGCTGGTACAGGATGCC'TTTGCAAAACTCATGAAGGG 

CCGCACAAGTTTCATCATTGCTCACCGTTTGTCAACCA.TTCAGGATGCGGATTTAATTCTTGTCTTAGT 
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AGATGGTGATATTGTTGAATATGGTAACCATCAAGAACTCATGGATAGAAAGGGTAAGTATTACCAAAT 
GCAAAAAGCTGCGGCTTTTAGTTCTGA 
A ' 

SP108 amino acid (SEQ ID NO: 190) 

KKSYHLFQKQTETRGIQTQLIEESLSQQTIIQSFNAQTEFIQRLREAHDNYSGYSQSAIFYSSTVNPST 
RFVNALIYALLAGVGAYRIMMGSALTVGRLVTFLNYVQQYTKPFNDISSVLAELQSALACVERIYGVLD 
SPEVAETGKEVLTTSDQVKGAISFKHVSFGYHPEKILIKDLSXDIPAGSKVAIVGPTGAGKSTLINLLM 
RFYPISSGDILLDGQSIYDYTRVSLRQQFGMVLQETWLTQGTIHDNIAFGNPEASREQVIAAAKAANAD 
FFIQQLPQGYDTKLENAGESLSVGQAQLLTIARVFLAIPKILILDEATSSXDTRTEVLVQDAFAKLMKG 
RTSF 1 1 AHRL STI QDADL I LVLVDGD IVEY GNHQELMDRKGKYY QMQKAAAF S S E 

SP109 nucleotide (SEQ ID NO: 191) 

ACGAAATGCAGGGCAGACAGATGCCTCGCAAATTGAAAAGGCGGCAGTTAGCCAAGGAGGAAAAGCAGT 

GAAAAAAACAGAAATTAGTAAAGACGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGG 

AGTGGAGGAATATTTCTCACGTGTTCCCGGGGTGACGGATGCCGTTTCAGGCTATGCAAATGGTAGAGG 

AGAAACAACCAAGTACGAATTGATTAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGC 

CAAGCAAATTTCTCTCAAGGAAATCCTGCTTCACTATTTCCGCATTATCAATCCAACCAGCAAAAATAA 

ACAAGGAAATGATGTGGGGACCCAGTACCGTACTGGTGTTTATTACACAGATGACAAGGATTTGGAAGT 

GATTAACCAAGTCTTTGATGAGGTGGCTAAGAAATACGATCAACCTCTAGCAGTTGAAAAGGAAAACTT 

GAAGAATTTTGTGGTGGCTGAGGATTACCATCAAGACTATCTCAAGAAAAATCCAAATGGCTACTGCCA 

TATCAATGTTAATCAGGCGGCCTATCCTGTCATTGATGCCAGCAAATATCCAAAACCAAGTGATGAGGA 

ATTGAAAAAGACCCTGTCACCTGAGGAGTATGCAGTTACCCAGGAAAATCAAACAGAACGAGCTTTCTC 

AAACCGTTACTGGGATAAATTTGAATCGGGTATCTATGTGGATATAGCAACTGGGGAACCTCTCTTTTC 

ATCAAAAGACAAATTTGAGTCTGGTTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGT 

CACCTACAAGGAAGATAAGTCCTACAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCA 

CCTTGGGCATGTCTTTACGGATGGTCCACAGGACAAGGGCGGCTTACGTTACTGTATCAATAGCCTCTC 

TATCCGCTTTATTCCCAAAGACCAAATGGAAGAAAAAGGCTACGCTTATTTACTAGATTATGTTGAT 

SP109 amino acid (SEQ ID NO: 192) 

RNAGQTDASQIEKAAVSQGGKAVKKTEISKDADLHEIYLAGGCFWGVEEYFSRVPGVTDAVSGYANGRG 
ETTKY E L I N QTGHAETVHVTYDAKQ I S LKE I LLH YF RI I N PT S KNKQGNDVGTQ YRTG V fYT DDKDL EV 
INQVFDEVAKKYDQPLAVEKENLKNFWAEDYHQDYLKKNPNGYCHINVNQAAYPVIDASKYPKPSDEE 
LKKTLSPEEYAVTQENQTERAFSNRYWDKFESGIYVDIATGEPLFSSKDKFESGCGWPSFTQPISPDVV 
TYKEDKSYNMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLSIRFIPKDQMEEKGYAYLLDYVD 

SP110 nucleotide (SEQ ID NO:193) 

TGTATAGTTTTTAGCGCTTGTTCTTCTAATTCTGNTAAAAATGAAGAAAATACTTCTAAAGAGCATGCG 

CCTGATAAAATAGTTTTAGATCATGCTTTCGGTCAAACTATATTAGATAAAAAACCTGAAAGAGTTGCA 

ACTATTGCTTGGGGAAATCATGATGTAGCATTAGCTTTAGGAATAGTTCCTGTTGGATTTTCAAAAGCA 

AATTACGGTGTAAGTGCTGATAAAGGAGTTTTACCATGGACAGAAGAAAAAATCAAAGAACTAAATGGT 

AA^GCTAACCTATTTGACGATTTGGATGGACTTAACTTTGAAGCAATATCAAATTCTAAACCAGATGTT 

ATCTTAGCAGGTTATTCTGGTATAACTAAAGAAGATTATGACACTCTATCA 

SP110 amino acid (SEQ ID NO: 194) 

CIVFSACSSNSXKNEENTSKEHAPDKIVLDHAFGQTILDKKPERVATIAWGNHDVALALGIVPVGFSKA 

NYGVSADKGVLPWTEEKIKELNGKANLFDDLDGLNFEAISNSKPDVILAGYSGITKEDYDTLS 

SP111 nucleotide (SEQ ID NO: 195) 

GTGTGTCGAGCATATTCTGAAGCAAACCTATCAAAATATAGAAATTATTTTAGTTGATGACGGTTCTAC 

GGATAATTCTGGGGAAATTTGTGATGCTTTTATGATGCAAGATAATCGTGTGCGAGTATTGCATCAAGA 

AAATAAGGGGGGGGCAGCACAAGCTAAAAATATGGGGATTAGTGTAGCTAAGGGAGAGTACATCACGAT 

TGTTGATTCAGATGATATCGTAAAAGAAAATATGATTGAAACTCTTTATCAGCAAGTCCAAGAAAAGGA 

TGCAGATGTTGTTATAGGGAATTACTATAATTATGACGAAAGTGACGGGAATTTTTATTTTTATGTAAC 

AGGGCAAGATTTTTGCGTCGAAGAATTAGCTATACAAGAAATTATGAACCGTCAAGCAGGAGATTGGAA 

ATTCAATAGCTCGGCCTTTATATTGCCGACATTTAAGTTGATTAAAAAAGAATTATTCAATGAAGTTCA 

CTTTTCAAATGGTCGCCGCTTTGATGATGAAGCAACTATGCATCGCTTTTATCTTTTAGCCTCTAAAAT 

CGTCTTTATAAACGATAATCTCTATCTGTATAGAAGACGTTCAGGAAGCATCATGAGAACGGAATTTGA 
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TCTTTCCTGGGCAAGAGATATTGTTGAAGTGTTTTCTAAGAAAATATCGGATTGTGTCTTGGCTGGTTT 

GGATGTCTCCGTTCTGCGTATTCGATTTGTCAATCTTTTAAAAGATTATAAGCAAACTTTAGAATACCA 

TCAATTAACAGATACTGAGGAATATAAAGATATTTGTTTCAGATTAAAGTTGTTTTTTGATGCAGAACA 

AAGAAATGGTAAAAGT 

SP111 amino acid (SEQ ID NO: 196) 

CVEH I L KQT Y Q N I E 1 1 L VDDG S TDNSG E I C D AFMMQDNR VR VL HQ ENKGG AAQ AKNMG I S VAKG E Y I T I 
VDSDDIVKENMlETLYQQVQEKDADWIGNYYNYDESDGNFYFyVTGQDFCVEELAIQEIMNRQAGDWK 
FNSSAFILPTFKLIKKELFNEVHFSNGRRFDDEATMHRFYLLASKIVFINDNLYLYRRRSGSIMRTEFD 
LSWARDIVEVFSKKISDCVLAGLDVSVLRIRFVNLLKDYKQTLEYHQLTDTEEYKDICFRLKLFFDAEQ 
RNGKS 

SPO 112 nucleotide (SEQ ID NO:197) 

GTGTTTGGATAGCATTCAGAATCAGACGTATCAAAATTTTGAGTGTTTATTAATCAATGATGGCTCTCC 
AGATCATTC ATCC AAAATATGTGAAG AATTTGT AGAGAAAGATTC TCG TTTC AAATATTTTG AGAAAGC 
AAACGGCGGTCTTTCATCAGCTCGTAACCTAGGTATTGAATGTTCGGGGGGGGCGTACATTACTTTTGT 
AGACTCTGATGATTGGTTGGAACATGATGCTTTAGACCGATTATATGGTGCTTTGAAAAAGGAAAACGC 
AGATATTAGTATCGGGCGTTATAATTCTTATGATGAAACACGCTATGTGTATATGACTTATGTTACGGA 
TCCAGATGATTCTCTAGAAGTGATAGAAGGTAAAGCAATTATGGATAGGGAAGGTGTCGAAGAAGTCAG 
AAATGGGAACTGGACTGTAGCTGTCTTGAAGTTATTCAAGAGAGAGTTACTACAAGATTTACCATTTCC 
TATAGGAAAAATTGCAGAGGATACTTACTGGACATGGAAGGTACTTCTAAGAGCTTCGAGGATAGTCTA 
TTTGAATCGTTGTGTTTACTGGTACCGTGTTGGTTTATCTGATACTTTATCGAATACATGGAGTGAAAA 
GCGTATGTATGATGAAATTGGGGCTAGGGAAGAAAAGATAGCTATTTTAGCAAGTTCAGACTATGACTT 
GACCAATCATATTTTGATTTATAAAAATAGATTACAAAGAGTGATAGCAAlAATTAGAAGAACAAAATAT 
GCAGTTCACAGAGATJTACAGAAGAATGATGGAAAAATTGTCTTTACTTCCG 

SPO 112 amino acid (SEQ ID NO:198> 

CLDSIQNQTYQNFECLLINDGSPDHSSKICEEFVEKDSRFKYFEKANGGLSSARNLGIECSGGAYITFV 

DSDDWLEHDALDRLYGALKKENADISIGRYNSYDETRYVYMTYVTDPDDSLEVIEGKAIMDREGVEEVR 

NGNWTVAVLKLFKRELLQDLPFPIGKIAEDTYWTWKVLLRASRIVYLNRCVYWYRVGLSDTLSNTWSEK 

RHYDEIGAREEKIAILASSDYDLTNHILIYKNRLQRVIAKLEEQNMQFTEIYRRMMEKESLL? 

SP113 nucleotide (SEQ ID NO: 199) 

GTGCCTAGATAGTATTATTACTCAAACATATAAAAATATTGAGATTGTTGTCGTTAATGATGGTTCTAC 

GGATGCTTCAGGTGAAATTTGTAAAGAATTTTCAGAAATGGATCACCGAATTCTCTATATAGAACAAGA 

AAATGCTGGTCTTTCTGCCGCACGAAACACCGGTCTGAATAATATGTCCGGAAATTATGTGACCTTTGT 

GGACTCGGATGATTGGATTGAGCAAGATTATGTAGAAACTCTATATAAAAAAATAGTAGAGTATCAGGC 

TGATATTGCAGTTGGTAATTATTATTCTTTCAACGAAAGTGAAGGAATGTTCTACTTTCATATATTGGG 

AGACTCCTATTATGAGAAAGTATATGATAATGTTTCTATCTTTGAGAACTTGTATGAAACTCAAGAAAT 

GAAGAGTTTTGCTTTGATATCTGCTTGGGGTAAACTCTATAAGGCAAGATTGTTTGAGCAGTTGCGCTT 

TGACATAGGTAAATTAGGAGAAGATGGTTACCTCAATCAAAAGGTATATTTATTATCAGAAAAGGTAAT 

TTATTTAAATAAAAGTCTTTATGCTTATCGGATTAGAAAAGGTAGTTTATCAAGAGTTTGGACAGAAAA 

GTGGATGCACGCTTTAGTTGATGCTATGTCTGAACGTATTAOGCTACTAGCTAATATGGGTTATCCTCT 

AGAGAAACACTTGGCAGTTTATCGTCAGATGTTGGAAGTCAGTCTCGCCAACGGTCAAGCTAGTGGTTT 

ATCTGACACAGCAACGTATAAAGAGTTTGAAATGAAACAAAGGCTTTTAAATCAGCTATCGAGACAAGA 

GGAAAGTGAAAAGAAAGCCATTGTCCTCGCAGCAAACTATGGCTATGTAGACCAAGTTTTAACGACAAT 

CAAGTCTATTTGTTATCATAATCGTTCGATTCGTTTTTATCTGATTCATAGCGATTTTCCAAATGAATG 

GATTAAGCAATTAAATAAGCGCTTAGAGAAGTTTGACTCAGAAATTATTAATTGTCGGGTAACTTCTGA 

GCAAATTTCATGTTATAAATCGGATATTAGTTACACAGTCTTTTTACGCTATTTCATAGCTGATTTCGT 

GCAAGAAGACAAGGCCCTCTACTTGGACTGTGATCTAGTTGTAACGAAAAATCTGGATGACTTGTTTGC 

TACAGACTTACAAGATTATCCTTTGGCTGCTGTTAGAGATTTTGGGGGCAGAGCTTATTTTGGTCAAGA 

AATCTTTAATGCCGGTGTTCTCTTGGTAAACAATGCTTTTTGGAAAAAAGAGAATATGACCCAAAAATT 

AATTGATGTAACCAATGAATGGCATGATAAGGTGGATCAGGCAGATCAGAGCATCTTGAATATGCTTTT 

TGAACATAAATGGTTGGAATTGGACTTTGATTATAATCATATTGTCATTCATAAACAGTTTGCTGATTA 

TCAATTGCCTGAGGGTCAGGATTATCCTGCTATTATTCACTATCTTTCTCATCGGAAACCGTGGAAAGA 

TTTGGCGGCCCAAACCTATCGTGAAGTTTGGTGGTACTATCATGGGCTTGAATGGACAGAATTGGGACA 

AAACCATCATTTACATCCATTACAAAGATCTCACATCTATCCAATAAAGGAACCTTTCACTTGTCTAAT 

CTATACTGCCTCAGACCATATTGAACAAATTGAGACATTGGTTCAATCCTTGCCTGATATTCAGTTTAA 
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GATAGCAGCTAGAGTAATAGTTAGTGATCGATTGGCTCAGATGACAATTTATCCAAACGTGACTATATT 

TAACGGAATTCACTATTTGGTAGATGTCGATAATGAATTGGTAGAAACCAGTCAAGTACTTTTAGATAT 

TAATCATGGCGAAAAGACAGAAGAAATTCTCGATCAATTTGCTAATCTTGGCAAGCCTATCTTATCCTT 

TGAAAATACTAAAACCTATGAAGTAGGTCAGGAGGCATATGCTGTTGACCAAGTTCAAGCAATGATTGA 

AAAATTGAGAGAAATAAGCAAA 

SP113 amino acid (SEQ ID NO: 200) 

CLDSIITQTYKNI2IWVNDGSTDASGEICKEFSEMDHRILYIEQENAGLSAARNTGLNNMSGNYVTFV 
DSDDWI EQDYVETLYKKI VE YQAD I AVGNYY SFNES EGMFYFH I LGDS YYEKVYDNV S I F ENLYETQEM 
KSFALISAWGKLYKARLFEQLRFDIGKLGEDGYLNQKVYLLSEKVIYLNKSLYAYHIRKGSLSRVWTEK 
WMHALVDAMSERITLLANMGYPLEKHIAVYRQMLEVSLANGQASGLSDTATYKEFEMKQKLLNQLSRQE 
ESEKKAIVLAANYGYVDQVLTTIKSICYHNRSIRFYLIHSDFPNEWIKQLNKRLEKFDSEIINCRVTSE 
QISCYKSDISYTVFLRYFIADFVQEDKALYLDCDLWTKNLDDLFATDLQDYPLAAVRDFGGRAYFGQE 
IFNAGVLLVNNAFWKKENMTQKLIDVTNEWHDKVDQADQSILNMLFEHKWLELDFDYNKIVIHKQFADY 
QLPEGQDYPAIIHYLSHRKPWKDLAAQTYREVWWYYHGLEWTELGQNHHLHPLQRSHIYPIKEPFTCLI 
YTASDHIEQIETLVQSLPDIQFKIAARVIVSDRLAQMTIYPNVTIFNGIHYLVDVDNELVETSQVLLDI 
NHGEKTEEILDQFANLGKPILSr ENTKTYEVGQEAYAVDQVQAMIEKLREISK 

SP114 nucleotide (SEQ ID NO:201) 

CATTCAGAAGCAGACCTATCAAAATCTGGAAATTATTCTTGTTGATGATGGTGCAACAGATGAAAGTGG 

TCGCTTGTGTGATTCAATCGCTGAACAAGATGACAGGGTGTCAGTGCTTCATAAAAAGAACGAAGGATT 

GTCGCAAGCACGAAATGATGGGATGAAGCAGGCTCACGGGGATTATCTGATTTTTATTGACTCAGATGA 

TTATATCCATCCAGAAATGATTCAGAGCTTATATGAGCAATTAGTTCAAGAAGATGCGGATGTTTCGAG 

CTGTGGTGTCATGAATGTCTATGCTAATGATGAAAGCCCACAGTCAGCCAATCAGGATGACTATTTTGT 

CTGTGATTCTCAAACATTTCTAAAGGAATACCTCATAGGTGAAAAAATACCTGGGACGATTTGCAATAA 

GCTAATCAAGAGACAGATTGCAACTGCCCTATCCTTTCCTAAGGGGTTGATTTACGAAGATGCCTATTA 

CCATTTTGATTTAATCAAGTTGGCCAAGAAGTATGTGGTTAATACTAAACCCTATTATTACTATTTCCA 

TAGAGGGGATAGTATTACGACCAAACCCTATGCAGAGAAGGATTTAGCCTATATTGATATCTACCAAAA 

GTTTTATAATGAAGTTGTGAAAAACTATCCTGACTTGAAAGAGGTCGCTTTTTTCAGATTGGCCTATGC 

CCACTTCTTTATTCTGGATAAGATGTTGCTAGATGATCAGTATAAACAGTTTGAAGCCTATTCTCAGAT 

TCATCGTTTTTTAAAAGGCCATGCCTTTGCTATTTCTAGGAATCCAATTTTCCGTAAGGGGAGAAGAAT 

TAGTGCTTTGGCCCTATTCATAAATATTTCCTTATATCGATTCTTATTACTGAAAAATATTGAAAAATC 

TAAAAAATTACAT 

SP 114 amino acid (SEQ ID NO: 202) 

IQKQTYQNLEIILVDDGATDESGRLCDSIAZQDDRVSVLHKKNEGLSQARNDGMKQAHG3YLIFIDSDD 

YIHPEMIQSLYEQLVQEDADVSSCGVMNVYANDESPQSANQDDYFVCDSQTFLKEYLIGEKIPGTICNK 

LIKRQIATALSFPKGLXYEDAYYHFDLIKLAKKYWNTKPYYYYFHRGDSITTKPYAEKDLAYIDIYQK 

FYNEWKNYPDLKEVArFRLAYAHFFILDKMLLDDQYKQFEAYSQIHRFLKGHAFAISRNPIFRKGRRI 

SALALFINISLYRFLLLKNIEKSKKLH 

SP115 nucleotide (SEQ ID NO; 203) 

TAAGGCTGATAATCGTGTTCAAATGAGAACGACGATTAATAATGAATCGCCATTGTTGCTTTCTCCGTT 
GTATGGCAATGATAATGGTAACGGATTATGGTGGGGGAACACATTGAAGGGAGCATGGGAAGCTATTCC 
TGAAG ATGTAAAGC C ATATGC AGC GATTGAACTTC ATCCTGC AAAAGTCTGT AAACC AAC AAGTTGTAT 
TCCACGAGATACGAAAGAATTGAGAGAATGGTATGTCAAGATGTTGGAGGAAGCTCAAAGTCTAAACAT 
TCCAGTTTTCTTGGTTATTATGTCGGCTGGAGAGCGTAATACAGTTCCTCCAGAGTGGTTAGATGAACA 
ATTCCAAAAGTATAGTGTGTTAAAAGGTGTTTTAAATATTGAGAATTATTGGATTTACAATAACCAGTT 
AGCTCCGCATAGTGCTAAATATTTGGAAGTTTGTGCCAAATATGGAGCGCATTTTATCTGGCATGATCA 
TGAAAAATGGTTCTGGGAAACTATTATGAATGATCCGACATTCTTTGAAGCGAGTCAAAAATATCATAA 
AAATTTGGTGTTGGCAACTAAAAATACGCCAATAAGAGATGATGCGGGTACAGATTCTATCGTTAGTGG 
ATTTTGGTTGAGTGGCTTATGTGATAACTGGGGCTCATCAACAGATACATGGAAATGGTGGGAAAAACA 
TTATACAAACACATTTGAAACTGGAAGAGCTAGGGATATGAGATCCTATGCATCGGAACCAGAATCAAT 
GATTGCTATGGAAATGATGAATGTATATACTGGGGGAGGCACAGTTTATAATTTCGAATGTGCCGCGTA 
TACATTTATGACAAATGATGTACCAACTCCAGCATTTACTAAAGGTATTATTCCTTTCTTTAGACATGC 
TATACAAAATCCAGCTCCAAGTAAGGAAGAAGTTGTAAATAGAACAAAAGCTGTATTTTGGAATGGAGA 
AGGTAGGATTAGTTCATTAAACGGATTTTATCAAGGACTTTATTCGAATGATGAAACAATGCCTTTATA 
TAATAATGGGAGATATCATATTCTTCCTGTAATACATGAGAAAATTGATAAGGAAAAGATTTCATCTAT 
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ATTCCCTAATGCAAAAATTTTGACTAAAAATAGTGAGGAATTGTCTAGTAAAGTCAACTATTTAAACTC 
GCTTTATCCAAAACTTTATGAAGGAGATGGGTATGCTCAGCGTGTAGGTAATTCCTGGTATATTTATAA 
TAGTAATGCTAATATCAATAAAAATCAGCAAGTAATGTTGCCTATGTATACTAATAATACAAAGTCGTT 
ATCGTTAGATTTGACGCCACATACTTACGCTGTTGTTAAAGAAAATCCAAATAATTTACATATTTTATT 
GAATAATTACAGGACAGATAAGACAGCTATGTGGGCATTATCAGGAAATTTTGATGCATCAAAAAGTTG 
G AAG AAAG AAGAATT AGAGTT AG C GAAC TGG ATAAGC AAAAA TT ATT C C AT C AATC C TGT AG AT AATGA 
CTTTAGGACAACAACACTTACATTAAAAGGGCATACTGGTCATAAACCTCAGATAAATATAAGTGGCGA 
TAAAAATCATTATACTTATACAGAAAATTGGGATGAGAATACCCATGTTTATACCATTACGGTTAATCA 
TAATGGAATGGTAGAGATGTCTATAAATACTGAGGGGACAGGTCCAGTCTCTTTCCCAACACCAGATAA 
ATTTAATGATGGTAATTTGAATATAGCATATGCAAAACCAACAACACAAAGTTCTGTAGATTACAATGG 
AGACCCTAATAGAGCTGTGGATGGTAACAGAAATGGTAATTTTAACTCTGGTTCGGTAACACACACTAG 
GGCAGATAATCCCTCTTGGTGGGAAGTCGATTTGAAAAAAATGGA.TAAAGTTGGGCTTGTTAAAATTTA 
TAATCGCACAGATGCTGAGACTCAACGTCTATCTAATTTT 

SP115 amino acid (SEQ ID NO: 204) 

KADNRVQMRTTINNES PLLLSPLYGNDNGNGLWWGNTLKGAWEAX PEDVKPYAAX ELHPAKVCKPT SC I 
PRDTKELREWYVKMLEEAQSLNIPVFLVIMSAGERNTVPPEWLDEQFQKYSVLKGVI^IENYWIYNNQL 
APHSAKYLEVCAKYGAHFIWHDHEKWFWETIMNDPTFFEASQKYHKNLVLATKNTPIRDDAGTDSIVSG 
FWLSGLCDNWGSSTDTWKWWEKHYTNTFETGRARDMRSYASEPESMIAMEMMNVYTGGG'T/YNFECAAY 
TFHTNDVPTPAFTKGIIPFFRHAIQNPAPSKEEWNRTKAVFWNGEGRISSLNGFYQGLYSNDETMPLY 
NNGRYHILPVIHEKXDKEKISSIFPNAKIETKNSEELSSKVNYLNSLYPKLYEGDGYAQRVGNSWYIYN 
SNANINKNQQVMLPMYTNNTKSLSLDLTPHTYAWKENPNNLHILLNNYRTDKTAMWALSGNFDASKSW 
KKEELELAFTWISKNYSINPVDNDFRTTTLTLKGHTGHKPQINISGDKNHYTYTENWDENTmr/TITVNH 
NGMVEMSINTEGTGPVSFPTPDKFNDGNLNIAYAKPTTQSSVDYNGDPNRAVDGNRNGNFNSGSVTHTR 
ADNPSWWEVDLKKMDKVGLVKZYNRTDAETQRLSNF 

SP 117 nucleotide {SEQ ID NO:205) 

CTGTGGCAATCAGTCAGCTGCTTCCAAACAGTCAGCTTCAGGAACGATTGAGGTGATTTCACGAGAAAA 
TGGCTCTGGGACACGGGGTGCCTTCACAGAAATCACAGGGATTCTCAAAAAAGACGGTGATAAAAAAAT 
TGACAACACTGCCAAAACAGCTGTGATTCAAAATAGTACAGAAGGTGTTCTCTCAGCAGTTCAAGGGAA 
TGCTAATGCTATCGGCTACATCTCCTTGGGATCTTTAACGAAATCTGTCAAGGCTTTAGAGATTGATGG 
TGTC AAGG C T AGT C G AG AC AC AGTT TT AG ATGGTG AAT AC C C T CT T C AAC GTC C C TTC AAC ATTGTTTG 
GTCTTCTAATCTTTCCAAGCTAGGTCAAGATTTTATCAGCTTTATCCACTCCAAACAAGGTCAACAAGT 
GGTCACAGATAATAAATTTATTGAAGCTAAAACCGAAACCACGGAATATACAAGCCAACACTTATCAGG 
CAAGTTGTCTGTTGTAGGTTCCACTTCAGTATCTTCTTTAATGGAAAAATTAGCAGAAGCTTATAAAAA 
AGAAAATCCAGAAGTTACGATTGATATTACCTCTAATGGGTCTTCAGCAGGTATTACCGCTGTTAAGGA 
GAAAACCGCTGATATTGGTATGGTTTCTAGGGAATTAACTCCTGAAGAAGGTAAGAGTCTCACCCATGA 
TGCTATTGCTTTAGACGGTATTGCTGTTGTGGTCAATAATGACAATAAGGCAAGCCAAGTCAGTATGGC 
TGAACTTGCAGACGTTTTTAGTGGCAAATTAACCACCTGGGACAAGATTAAA 

SP117 amino acid {SEQ ID NO: 206) 

CGNQ S AAS KQ SASGT I EVI SRENG SGTRG AFTE ITG I LKKDGDKK I DNTAKTAVIQN STEGVLS AVQGN 
ANAIGYISLGSLTKSVKALEIDGVKASRDTVLDGEYPLQRPFNIVWSSNLSKLGQDFISFIHSKQGQQV 
VTDNKFIEAKTETTEYTSQHLSGKLSWGSTSVSSLMEKLAEAYKKENPEVTIDITSNGSSAGITAVKE 
KTAD I GMVSRELT P EEGKS LTHDA I ALDG I AVVVNNDNKASQ V SMAELADVF SGKLTTWDKI K 

SP118 nucleotide (SEQ ID NO:207) 

TTGTCAACAACAACATGCTACTTCTGAGGGGACGAATCAAAGGCAAAGCAGTTCAGCGAAAGTTCCATG 

GAAAGCTTCATACACCAACCTAAACAACCAGGTAAGTACAGAAGAGGTCAAATCTCTCTTATCAGCTCA 

CTTGGATCCAAATAGTGTTGATGCATTTTTTAATCTCGTTAATGACTATAATACCATTGTCGGCTCAAC 

TGGCTTATCAGGAGATTTCACTTCCTTTACTCACACCGAATACGATGTTGAGAAAATCAGTCATCTCTG 

GAATCAAAAGAAGGGCGATTTTGTTGGGACCAACTGCCGTATCAATAGTTATTGTCTTTTGAAAAATTC 

AGTCACCATTCCAAAGCTTGAAAAGAATGACCAGTTGCTTTTCCTAGATAATGATGCGATTGATAAAGG 

AAAGGTCTTTGATTCACAAGATAAGGAAGAGTTTGATATTCTATT7TCGAGAGTTCCAACTGAGTCAAC 

TACAGATGTCAAGGTTCACGCTGAAAAGATGGAAGCATTCTTCTCACAAT'TTCAATTCAATGAAAAAGC 

TCGAATGCTGTCTGTAGTCTTGCACGACAATTTGGATGGCGAGTATCTGTTTGTAGGCCACGTTGGGGT 

CTTAGTACCTGCTGATGACGGTTTCTTATTTGTAGAGAAATTGACTTTCGAAGAGCCCTACCAAGCGAT 
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TAAATTTGCTAGTAAGGAAGATTGCTACAAGTATTTGGGCA.CCAAGTATGCGGATTATACAGGCGAGG" 

ACTGGCTAAGC CTTTTATCATGGATAATG ATAAGTGGGTTAAAC TT 


SP118 amino acid (SEQ ID NO:208) 

CQQQHATSEGTNQRQSSSAKVPWKASYTNLNNQVSTEEVKSLLSAHLDPNSVDAFFNLVNDYNTIVG3T 

GLSGDFTSFTHTEYDVEKISKLWNQKKGDFVGTNCRINSYCLLKNSVTIPKLEKNDQLLFLDNDAIDKG 

KVFDSQDKEEFDILFSRVPTESTTDVKVHAEKMEAFFSQFQFNEKARMLSWLHDNLDGEYLFVGHVGV 

LVPADDGFLFVEKLTFEEPYQAIKFASKEDCYKYLGTKYADYTGEGLAKPFIMDNDKWVKL 


SP119 nucleotide (SEQ ID NO:209) 

TTGTTCAGGCAAGTCCGTGACTAGTGAACACCAAACGAAAGATGAAATGAAGACGGAGCAGACAGCTAG 

taaaacaagcgcagctaaagggaaagaggtggctgattttgaattgatgggagtagatggcAagaccta 

CCGTTTATCTGATTACAAGGGCAAGAAAGTCTATCTCAAATTCTGGGCTTCTTGGTGTTCCATCTGTCT 

GGCTAGTCTTCCAGATACGGATGAGATTGCTAAAGAAGCTGGTGATGACTATGTGGTCTTGACAGTAGT 

GTCACCAGGACATAAGGGAGAGCAATCTGAAGCGGACTTTAAGAATTGGTATAAGGGATTGGATTATAA 

AAATCTCCCAGTCCTAGTTGACCCATCAGGCAAACTTTTGGAAACTTATGGTGTCCGTTCTTACCCAAC 

CCAAGCCTTTATAGACAAAGAAGGCAAGCTGGTCAAAACACATCCAGGATTCATGGAAAAAGATGCAAT' 

TTTGCAAACTTTGAAGGAATTAGCC 


SP119 amino acid (SEQ ID NO: 210) 

CSGKSVTSEHQTKDEMKTEQTASKTSAAKGKEVADFELMGVDGKTYRLSDYKGKKVYLKFWASWCSICL 

ASLPDTDEIAKEAGDDYWLTWSPGHKGEQSEADFKNWYKGLDYKNLPVLVDPSGKLLETYGVRSY^ 

QAF I DKEGKLVKTHPGFMEKDAI LQTLKELA 


SP120 nucleotide (SEQ ID NO:211) 

CTCGCAAATTGAAAAGGCGGCAGTTAGCCAAGGAGGAAAAGCAGTGAAAAAAACAGAAATTAGTAAAGA 

CGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGGAGTGGAGGAATATTTCTCACGTGT 

tcccggggtgacggatgccgtttcaggctatgcaaatggtagaggagaaacaaccaagtacgaattgat 

TAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGCCAAGCAAATTTCTCTCAAGGAAAT 

CCTGCTTCACTATTTCCGCATTATCAATCCAACCAGCAAAAATAAACAAGGAAATGATGTGGGGACCCA 

GTACCGTACTGGTGTTTATTACACAGATGACAAGGATTTGGAAGTGATTAACCAAGTCTTTGATGAGGT 

GGCTAAGAAATACGATCAACCTCTAGCAGTTGAAAAGGAAAACT'rGAAGAATT'TTGTGGTGGCTGAGGA 

TTACCATCAAGACTATCTCAAGAAAAATCCAAATGGCTACTGCCATATCAATGTTAATCAGGCGGCCTA 

TCCTGTCATTGATGCCAGCAAATATCCAAAACCAAGTGATGAGGAATTGAAAAAGACCCTGTCACCTGA 

GGAGTATGCAGTTACCCAGGAAAATCAAACAGAACGAGCTTTCTCAAACCGTTACTGGGATAAATTTGA 

ATCCGGTATCTATGTGGATATAGCAACTGGGGAACCTCTCTTTTCATCAAAAGACAAATTTGAGTCTGG 

TTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGTCACCTACAAGGAAGATAAGTCCTA 

CAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCACCTTGGGCATGTCTTTACGGATGG 

TCCACAGGACAAGGGCGGCTTACGTTACTGTATCAATAGCCTCTCTATCCGCTTTATTCCCAAAGACCA 

AATGGAAGAAAAAGGTACGCTTATTTAC 


SP12 0 amino acid (SEQ ID NO: 2X2) 

SQIEKAAVSQGGKAVKKTEISKDADLHEIYLAGGCFWGVEEYFSRVPGVTDAVSGYANGRGETTKYELI 

nqtghaetvhvtydakqislkeillkyfriinptsknkqgndvgtqyrtgvyytddkdlevinqvfdev 

AKKYDQPLAVEKENLKNFWAEDYHQDYLKKNPNGYCHINVNQAAYPVIDASKYPKPSDEELKKTLSPE 

EYAVTQENQTERAFSNRYWDKFESGIYVDIATGEPLFSSKDKFESGCGWPSFTQPISPDWTYKEDKSY 

NMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLSIRFIPKDQMEEKGTLIY 

SP121 nucleotide (SEQ ID NO:213) 

TTGTCAGTCAGGTTCTAATGGTTCTCAGTCTGCTGTGGATGCTATCAAACAAAAAGGGAAATTAGTTGT 

GGCAACCAGTCCTGACTATGCACCCTTTGAATTTCAATCATTGGTTGATGGAAAGAACCAGGTAGTCGG 

TGCAGACATCGACATGGCTCAGGCTATCGCTGATGAACTTGGGGTTAAGTTGGAAATCTCAAGCATGAG 

ttttgacaatgttttgaccagtcttcaaactggtaaggctgacctagcagttgcaggaattagtgctac 

tgacgagagaaaagaagtctttgatttttcaatcccatactatgaaaacaagattagtttcttggttcg 

taaggctgatgtggaaaaatacaaggatttaactagcctagaaagtgctaatattgcagcccaaaaagg 

gactgttccagaatcaatggtcaaggaacaattgccaaaagttcaattaacttccctaactaatatggg 

tgaagcagtcaatgaattgcaggctggaaaaatagatgctgttcatatggatgagcctgttgcacttag 
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TTATGCTGCTAAAAACGCTGGCTTAGCTGTCGCAACTGTCAGCTTGAAGATGAAGGACGGCGACGCCAA 

TGCC 

SP121 amino acid (SEQ ID NO:2l4) 

CQSGSNGSQSAVDAIKQKGKLWATSPDYAPFEFQSLVDGKNQWGADIDMAQAIADELGVXLEISSMS 

FDNVLTSLQTGKADEiAVAGISATDERKEVFDFSIPYYENKISFLVRKADVEKYKDLTSLESANIAAQKG 

TVPESMVKEQLPKVQLTSLTNMGEAVNELQAGKIDAVHMDEPVALSYAAKNAGLAVATVSLKHKDGDAN 

A 

SP122 nucleotide (SEQ ID NO: 215) 

GGAAACTTCACAGGATTTTAAAGAGAAGAAAACAGCAGTCATTAAGGAAAAAGAAGTTGTTAGTAAAAA 
TCCTGTGATAGACAATAACACTAGCAATGAAGAAGCAAAAATCAAAGAAGAAAATTCCAATAAATCCCA 
AGGAGATTATACGGACTCATTTGTGAATAAAAACACAGAAAATCCCAAAAAAGAAGATAAAGTTGTCTA 
TATTGCTGAATTTAAAGATAAAGAATCTGGAGAAAAAGCAATCAAGGAACTATCCAGTCTTAAGAATAC 
AAAAGTTTTATATACTTATGATAGAATTTTTAACGGTAGTGCCATAGAAACAACTCCAGATAACTTGGA 
CAAAATTAAACAAATAGAAGGTATTTCATCGGTTGAAAGGGCACAAAAAGTCCAACCCATGATGAATCA 
TGCCAGAAAGGAAATTGGAGTTGAGGAAGCTATTGATTACCTAAAGTCTATCAATGCTCCGTTTGGGAA 
AAATTTTGATGGTAGAGGTATGGTCATTTCAAATATCGATACTGGAACAGATTATAGACATAAGGCTAT 
GAGAATCGATGATGATGCCAAAGCCTCAATGAGATTTAAAAAAGAAGACTTAAAAGGCACTGATAAAAA 
TTATTGGTTGAGTGATAAAATCCCTCATGCGTTCAATTATTATAATGGTGGCAAAATCACTGTAGAAAA 
ATATGATGATGGAAGGGATTATTTTGACCCACATGGGATGCATATTGCAGGGATTCTTGCTGGAAATGA 
TACTGAACAAGACATCAAAAACTTTAACGGCATAGATGGAATTGCACCTAATGCACAAATTTTCTCTTA 
CAAAATGTATTCTGACGCAGGATCTGGGTTTGCGGGTGATGAAACAATGTTTCATGCTATTGAAGATTC 
TATCAAACACAACGTTGATGTTGTTTCGGTATCATCTGGTTTTACAGGAACAGGTCTTGTAGGTGAGAA 
ATATTGGCAAGCTAT^CGGGCATTAAGAAAAGCAGGCATTCCAATGGTTGTCGCTACGGGTAACTATGC 
GACTTC TG C TTC AAGTTC TTC ATGGG ATTT AGT AGC AAAT AATC ATCTG AAAATG ACCG A C AC TGG AAA 
TGTAACACGAACTGCAGCACATGAAGATGCGATAGCGGTCGCTTCTGCTAAAAATCAAACAGTTGAGTT 
TGATAAAGTTAACATAGGTGGAGAAAGTTTTAAATACAGAAATATAGGGGCCTTTTTCGATAAGAGTAA 
AATCACAACAAATGAAGATGGAACAAAAGCTCCTAGTAAATTAAAATTTGTATATATAGGCAAGGGGCA 
AGACCAAGATTTGATAGGTTTGGATCTTAGGGGCAAAATTGCAGTAATGGATAGAATTTATACAAAGGA 
TTTAAAAAATGCTTTTAAAAAAGCTATGGATAAGGGTGCACGCGCCATTATGGTTGTAAATACTGTAAA 
TTACTACAATAGAGATAATTGGACAGAGCTTCCAGCTATGGGATATGAAGCGGATGAAGGTACTAAAAG 
TCAAGTGTTTTCAATTTCAGGAGATGATGGTGTAAAGCTATGGAACATGATTAATCCTGATAAAAAAAC 
TGAAGTCAAAAGAAATAATAAAGAAGATTTTAAAGATAAATTGGAGCAATACTATCCAATTGATATGGA 
AAGTTTTAATTCCAACAAACCGAATGTAGGTGACGAAAAAGAGATTGACTTTAAGTTTGCACCTGACAC 
AGACAAAGAACTCTATAAAGAAGATATCATCGTTCCAGCAGGATCTACATCTTGGGGGCCAAGAATAGA 
TTTACTTTTAAAACCCGATGTTTCAGCACCTGGTAAAAATATTAAATCCACGCTTAATGTTATTAATGG 
CAAATCAACTTATGGCTATATGTCAGGAACTAGTATGGCGACTCCAATCGTGGCAGCTTCTACTGTTTT 
GATTAGACCGAAATTAAAGGAAATGCTTGAAAGACCTGTATTGAAAAATCTTAAGGGAGATGACAAAAT 
AGATCTTACAAGTCTTACAAAAATTGCCCTACAAAATACTGCGCGACCTATGATGGATGCAACTTCTTG 
GAAAGAAAAAAGTCAATACTTTGCATCACCTAGACAACAGGGAGCAGGCCTAATTAATGTGGCCAATGC 
TTTGAGAAATGAAGTTGTAGCAACTTTCAAAAACACTGATTCTAAAGGTTTGGTAAACTCATATGGTTC 
CATTTCTCTTAAAGAAATAAAAGGTGATAAAAAATACTTTACAATCAAGCTTCACAATACATCAAACAG 
ACCTTTGACTTTTAAAGTTTCAGCATCAGCGATAACTACAGATTCTCTAACTGACAGATTAAAACTTGA 
TGAAACATATAAAGATGAAAAATCTCCAGATGGTAAGCAAATTGTTCCAGAAATTCACCCAGAAAAAGT 
CAAAGGAGCAAATATCACATTTGAGCATGATACTTTCACTATAGGCGCAAATTCTAGCTTTGATTTGAA 
TGCGGTTATAAATGTTGGAGAGGCCAAAAACAAAAATAAATTTGTAGAATCATTTATTCATTTTGAGTC 
AGTGGAAGCGATGGAAGCTCTAAACTCCAGCGGGAAGAAAATAAACTTCCAACCTTCTTTGTCGATGCC 
TCTAATGGGATTTGCTGGGAATTGGAACCACGAACCAATCCTTGATAAATGGGCTTGGGAAGAAGGGTC 
AAGATCAAAAACACTGGGAGGTTATGATGATGATGGTAAACCGAAAATTCCAGGAACCTTAAATAAGGG 
AATTGGTGGAGAACATGGTATAGATAAATTTAATCCAGCAGGAGTTATACAAAATAGAAAAGATAAAAA 
TACAACATCCCTGGATCAAAATCCAGAATTATTTGCTTTCAATAACGAAGGGATCAACGCTCCATCATC 
AAGTGGTTCTAAGATTGCTAACATTTATCCTTTAGATTCAAATGGAAATCCTCAAGATGCTCAACTTGA 
AAGAGGATTAACACCTTCTCCACTTGTATTAAGAAGTGCAGAAGAAGGATTGATT 

SP122 amino acid <SEQ ID NO: 216) 

ETSQDFKEKKTAVIKEKEWSKNPVIDNNTSNEEAKIKEENSNKSQGDYTDSFVNKNTENPKKEDKWY 

IAEFKDKESGEKAIKELSSLKNTKVLYTYDRIFNGSAIETTPDNLDKIKQIEGISSVERAQKVQPMMNH 
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ARKEIGVEEAIDYLKSINAPFGKNFDGRGMVISNIDTGTDYRHKAMRIDDDAKASMRFKKEDLKGTDKN 
YWLSDKIPHAFNYYNGGKITVEKYDDGRDYFDPHGMHIAGILAGNDTEQDXKNFNGIDGIAPNAQIFSY 
KMYSDAGSGFAGDETMFHAIEDSIKHNVDWSVSSGFTGTGLVGEKYWQAIRALRKAGXPMWATGNYA 
TSASSSSWDLVANNHLKMTDTGNVTRTAAHEDAIAVASAKNQTVEFDKVNIGGESFKYRNXGAFFDKSK 
ITTNEDGTKAPSKLKFVYIGKGQDQDLIGLDLRGKIAVMDRIYTKDLKNAFKKAMDKGARAIMWNTVN 
YYNRDNWTELPAMGYEADEGTKSQVFSISGDDGVKLWNMINPDKKTEVKRNNKEDFKDKLEQYYPIDME 
SFNSNKPNVGDEKEIDFKFAPDTDKELYKEDXIVPAGSTSWGPRIDLLLKPDVSAPGKNIKSTLNVING 
KST YGYMSGTSMAT P I VAASTVL I RPKLKEMLERP VLKNLKGDDKI DLT S LTKI ALQNTAR PMMDATSW 
KEKSQYFASPRQQGAGLINVANALRNEWATFKNTDSKGLVNSYGSISLKEIKGDKKYFTIKLHNTSNR 
PLTFKVSASAITTDSLTDRLKLDETYKDEKSPDGKQIVPEIHPEKVKGANITFEHDTFTIGANSSFDLN 
AVINVGEAKNKNKFVESFXHFESVEAMEALNSSGKKINFQPSLSMPLMGFAGNWNHEPILDKWAWEEGS 
RSKTLGGYDDDGKPKIPGTLNKGIGGEHGIDKFNPAGVIQNRKDKNTTSLDQNPELFAFNNEGINAPSS 
SGSKIANIYPLDSNGNPQDAQLERGLTPSPLVLRSAEEGLI 

SP123 nucleotide (SEQ ID NO: 217) 

TGTGGTCGAAGTTGAGACTCCTCAATCAATAACAAATCAGGAGCAAGCTAGGACAGAAAACCAAGTAGT 

AGAGACAGAGGAAGCTCCAAAAGAAGAAGCACCTAAAACAGAAGAAAGTCCAAAGGAAGAACCAAAATC 

GGAGGTAAAACCTACTGACGACACCCTTCCTAAAGTAGAAGAGGGGAAAGAAGATTCAGCAGAACCAGC 

TCCAGTTGAAGAAGTAGGTGGAGAAGTTGAGTCAAAACCAGAGGAAAAAGTAGCAGTTAAGCCAGAAAG 

TCAACCATCAGACAAACCAGCTGAGGAATCAAAAGTTGAACAAGCAGGTGAACCAGTCGCGCCAAGAGA 

AGACGAAAAGGCACCAGTCGAGCCAGAAAAGCAACCAGAAGCTCCTGAAGAAGAGAAGGCTGTAGAGGA 

AACACCGAAACAAGAAGAGTCAACTCCAGATACCAAGGCTGAAGAAACTGTAGAACCAAAAGAGGAGAC 

TGTTAATCAATCTATTGAACAACCAAAAGTTGAAACGCCTGCTGTAGAAAAACAAACAGAACCAACAGA 

GGAACCAAAAGTTGAACAAGCAGGTGAACCAGTCGCGCCAAGAGAAGACGAACAGGCACCAACGGCACC 

AGTTGAGCCAGAAAAGCAACCAGAAGTTCCTGAAGAAGAGAAGGCTGTAGAGGAAACACCGAAACCAGA 

AGATAAAATAAAGGGTATTGGTACTAAAGAACCAGTTGATAAAAGTGAGTTAAATAATCAAATTGATAA 

AGCTAGTTCAGTTTCTCCTACTGATTATTCTACAGCAAGTTACAATGCTCTTGGACCTGTTTTAGAAAC 

TGCAAAAGGTGTCTATGCTTCAGAGCCTGTAAAACAGCCTGAGGTAAATAGCGAGACAAATAAACTTAA 

AACGGCTATTGACGCTCTAAACGTTGATAAAACTGAATTAAACAATACGATTGCAGATGCAAAAACAAA 

GGTAAAAGAACATTACAGTGATAGAAGTTGGCAAAACCTCCAAACTGAAGTTACAAAGGCTGAAAAAGT 

TGCAGCTAATACAGATGCTAAACAAAGTGAAGTTAACGAAGCTGTTGAAAAATTAACTGCAACTATTGA 

AAAATTGGTTGAATTATCTGAAAAGCCAATATTAACATTGACTAGTACCGATAAGAAAATATTGGAACG 

TGAAGCTGTTGCTAAGTATACTCTAGAAAATCAAAACAAAACAAAAATCAAATCAATCACAGCTGAATT 

GAAAAAAGGAGAAGAAGTTATTAATACTGTAGTCCTTACAGATGACAAGGTAACAACAGAAACTATAAG 

CGCTGCATTTAAGAACCTAGAGTACTACAAAGAATACACCCTATCTACAACTATGATTTACGACAGAGG 

TAACGGTGAAGAAACTGAAACTCTAGAAAATCAAAATATTCAATTAGATCTTAAAAAAGTTGAGCTTAA 

AAATATTAAACGTACAGATTTAATCAAATACGAAAATGGAAAAGAAACTAATGAATCACTGATAACAAC 

TATTCCTGATGATAAGAGCAATTATTATTTAAAAATAACTTCAAATAATCAGAAAACTACATTACTAGC 

tgttaaaaatatagaagaaactacggttaacggaacacctgtatataaagttacagcaatcgcagacaa 

TTTAGTCTCTAGAACTGCTGATAATAAATTTGAAGAAGAA 

SP123 amino acid (SEQ ID NO: 218) 

WEVETPQSITNQEQARTENQWETEEAPKEEAPKTEESPKEEPKSEVKPTDDTLPKVEEGKEDSAEPA 

PVEEVGGEVESKPEEKVAVKPESQPSDKPAEESKVEQAGEPVAPREDEKAPVEPEKQPEAPEEEKAVEE 

TPKQEESTPDTKAEETVEPKEETVNQSIEQPKVETPAVEKQTEPTEEPKVEQAGEPVAPREDEQAPTAP 

VEPEKQPEVPEEEKAVEETPKPEDKIKGIGTKEPVDKSELNNQIDKASSVSPTDYSTASYNALGPVX.ET 

AKGVYASEPVKQPEVNSETNKLKTAIDALNVDKTELNNTIADAKTKVKEHYSDRSWQNLQTEVTKAEKV 

AANTDAKQSEVNEAVEKLTATIEKLVELSEKPILTLTSTDKKILEREAVAKYTLENQNKTKIKSXTAEL 

KKGEEVINTWLTDDKVTTETISAAFKNLEYYKEYTLSTTMIYDRGNGEETETLENQNIQLDLKKVELK 

NXKRTDLIKYENGKETNESLITTIPDDKSNYYLKITSNNQKTTLLAVKNIEETTVNGTPVYKVTAIADN 

LVSRTADNKFEEE 

SP124 amino acid (SEQ ID NO: 219) 

AACACCTGTATATAAAGTTACAGCAATCGCAGACAATTTAGTCTCTAGAACTGCTGATAATAAATTTGA 
AGAAGAATACGTTCACTATATTGAAAAACCTAAAGTCCACGAAGATAATGTATATTATAATTTCAAAGA 
ATTAGTGGAAGCTATTCAAAACGATCCTTCAAAAGAATATCGTCTGGGACAATCAATGAGCGCTAGAAA 
TGTTGTTC CT AATGGAAAATC AT AT ATC AC TAAAGAATTC AC AGGAAAACTTTTAAGTTCTG AAGGAAA 
ACAATTTGCTATTACTGAATTGGAACATCCATTATTTAATGTGATAACAAACGCAACGATAAATAATGT 
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GAATTTTGAAAATGTAGAGATAGAACGTTCTGGTCAAGATAATATTGCATCATTAGCCAATACTATGAA 

AGGTTCTTCAGTTATTACAAATGTCAAAATTACAGGCACACTTTCAGGTCGTAATAATGTTGCTGGATT 

TGTAAATAATATGAATGATGGAACTCGTATTGAAAATGTTGCTTTCTTTGGCAAACTACACTCTACAAG 

TGGAAATGGCTCTCATACAGGGGGAATTGCAGGTACAAACTATAGAGGAATTGTTAGAAAAGCATATGT 

TGATGCTACTATTACAGGAAACAAAACACGCGCCAGCTTGTTAGTTCCTAAAGTAGATTATGGATTAAC 

TCTAGACCATCTTATTGGTACAAAAGCTCTCCTAACTGAGTCGGTTGTAAAAGGTAAAATAGATGTTTC 

AAATCCAGTAGAAGTTGGAGCAATAGCAAGTAAGACTTGGCCTGTAGGTACGGTAAGTAATTCTGTCAG 

CTATGCTAAGATTATCCGTGGAGAGGAGTTATTCGGCTCTAACGACGTTGATGATTCTGATTATGCTAG 

TGCTCATATAAAAGATTTATATGCGGTAGAGGGATATTCGTCAGGTAATAGATCATTTAGGAAATCTAA 

AACATTTACTAAATTAACTAAAGAACAAGCTGATGCTAAAGTTACTACTTTCAATATTACTGCTGATAA 

ATTAGAAAGTGATCTATCTCCTCTTGCAAAACTTAATGAAGAAAAAGCCTATTCTAGTATTCAAGATTA 

TAACGCTGAATATAACCAAGCCTATAAAAATCTTGAAAAATTAATACCATTCTACAATAAAGATTATAT 

TGTATATCAAGGTAATAAATTAAATAAAGAACACCATCTAAATACTAAAGAAGTTCTTTCTGTTACCGC 

GATGAACAACAATGAGTTTATCACAAACCTAGATGAAGCTAATAAAATTATTGTTCACTATGCGGACGG 

TACAAAAGATTACTTTAACTTGTCTTCTAGCAGTGAAGGTTTAAGTAATGTAAAAGAATATACTATAAC 

TGACTTAGGAATTAAATATACACCTAATATCGTTCAAAAAGATAACACTACTCTTGTTAATGATATAAA 

ATCTATTTTAGAATCAGTAGAGCTTCAGTCTCAAACGATGTATCAGCATCTAAATCGATTAGGTGACTA 

TAGAGTTAATGCAATCAAAGATTTATATTTAGAAGAAAGCTTCACAGATGTTAAAGAAAACTTAACAAA 

CCTAATCACAAAATTAGTTCAAAACGAAGAACATCAACTAAATGATTCTCCAGCTGCTCGTCAAATGAT 

TCGTGATAAAGTCGAGAAAAACAAAGCAGCTTTATTACTAGGTTTAACTTACCTAAATCGTTACTATGG 

AGTTAAATTTGGTGATGTTAATATTAAAGAATTAATGCTATTCAAACCAGATTTCTATGGTGAAAAAGT 

TAGCGTATTAGACAGATTAATTGAAATCGGTTCTAAAGAGAACAACATTAAAGGTTCACGTACATTCGA 

CGCATTCGGTCAAGTA 

SP124 amino acid (SEQ ID NO: 220) 

TPVYKVTAIADNLVSRTADNKFEEEYVHYXEKPKVHEDNVYYNFKELVEAIQNDPSKEYRLGQSMSARN 

WPNGKSYITKEFTGKLLSSEGKQFAITELEHPLFNVITNATINNVNFENVEIERSGQDNIASLANTMK 

GSSVITNVKITGTLSGRNNVAGFVNNMNDGTRIENVAFFGKLHSTSGNGSHTGGXAGTNYRGIVRKAYV 

DATITGNKTRASLLVPKVDYGLTLDHLIGTKALLTESWKGKIDVSNPVEVGAIASKTWPVGTVSNSVS 

YAKIIRGEELFGSNDVDDSDYASAHIKDLYAVEGYSSGNRSFRKSKTFTKLTKEQADAKVTTFNITADK 

LESDLSPLAKLNEEKAYSSIQDYNAEYNQAYKNLEKLIPFTOKDYIVYQGNKLNKEHHLNTKEVLSVTA 

MNNNEFITNLDEANKIIVHYADGTKDYFNLSSSSEGLSNVKEYTITDLGIKYTPNIVQKDNTTLVNDIK 

SILESVELQSQTMYQHLNRLGDYRVNAIKDLYLEESFTDVKENLTNLITKLVQNEEHQLNDSPAARQMI 

RDKVEKNKAALLLGLTYLNRYYGVKFGDVNIKELMLFKPDFYGEKVSVLDRLIEIGSKENNIKGSRTFD 

AFGQV 

SP125 nucleotide (SEQ ID NO:221> 

ATTAGACAGATTAATTGAAATCGGTTCTAAAGAGAACAACATTAAAGGTTCACGTACATTCGACGCATT 

CGGTCAAGTATTGGCTAAATATACTAAATCAGGTAATTTAGATGCATTTTTAAATTATAATAGACAATT 

GTTCACAAATATAGACAATATGAACGATTGGTTTATTGATGCTACAGAAGACCATGTCTACATCGCAGA 

ACGCGCTTCTGAGGTCGAAGAAATTAAAAATTCTAAACATCGTGCATTCGATAATTTAAAACGAAGTCA 

CCTTAGAAATACTATACTCCCACTACTGAATATTGATAAAGCACATCTTTATTTAATTTCAAATTATAA 

TGCAATTGCCTTTGGTAGTGCAGAGCGATTAGGTAAAAAATCATTAGAAGATATTAAAGATATCGTTAA 

CAAAGCTGCAGATGGTTATAGAAACTATTATGATTTCTGGTATCGTCTAGCGTCTGATAACGTTAAACA 

ACGACTACTAAGAGATGCTGTTATTCCTATTTGGGAAGGTTATAACGCTCCTGGTGGATGGGTTGAAAA 

ATATGGCCGCTATAATACCGACAAAGTATATACTCCTCTTAGAGAATTCTTTGGTCCTATGGATAAGTA 

TTATAATTATAATGGAACAGGAGCTTATGCTGCTATATATCCTAACTCTGATGATATTAGAACTGATGT 

AAAATATGTTCATTTAGAAATGGTTGGTGAATACGGTATTTCAGTTTACACACATGAAACAACACACGT 

CAACGACCGTGCGATTTACTTAGGTGGCTTTGGACACCGTGAAGGTACTGATGCTGAAGCATATGCTCA 

GGGTATGCTACAAACTCCTGTTACTGGTAGTGGATTTGATGAGTTTGGTTCTTTAGGTATTAATATGGT 

ATTTAAACGCAAAAATGATGGGAATCAGTGGTATATTACAGATCCAAAAACTCTAAAAACACGAGAAGA 

TATTAATAGATATATGAAGGGTTATAATGACACTTTAACTCTTCTTGATGAAATTGAGGCTGAATCTGT 

GATTTCTCAACAAAATAAAGATTTAAATAGTGCATGGTTCAAAAAAATAGATAGAGAATACCGTGATAA 

CAATAAATTAAATCAATGGGATAAAATTCGAAATCTAAGTCAAGAAGAGAAAAATGAATTAAATATTCA 

ATCTGTTAATGATTTAGTTGATCAACAATTAATGACTAATCGCAATCCAGGTAATGGTATCTATAAACC 

CGAAGCAATTAGCTATAACGATCAATCACCTTATGTAGGTGTTAGAATGATGACCGGTATCTACGGAGG 

TAATACTAGTAAAGGTGCTCCTGGAGCTGTTTCATTCAAACATAATGCTTTTAGATTATGGGGTTACTA 

CGGATACGAAAATGGGTTCTTAGGTTATGCTTCAAATAAATATAAACAACAATCTAAAACAGATGGTGA 
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gtctgttctaagtgatgaatat. 


TTATAAGAAACTTCTTCAACAAACAAATAGCTTTAAAACT^cS ^^^^^ 0 " 07 


SP125 


amino acid 


(SEQ id 


RASEVEEIIgShSdnSsHL^^ 

KAADGYRNYYDFVm?IASDNVKORLLRDAVTPT';p^ MYNAIAFGSAERLGKKSL EDIKDIW 

( ™gtgayaaiypnsddi^^^l^geygi™pp^ EKYGRYNTDKVVTPLREFFG pmdky 

svLsDEViiKKis m rrFtrriE E FKSvre™^r^ K ' ,yGYEt 'G FM »'S»Kv:<QQs K SE 

SP126 nucleotide (SEQ id NO: 223) 

CAAGGGATTTGTGn a * * » m . ?“:^ AATCAGGACAAGTG GATGCCGTTATCT-TG2 1 »r,, „ JClI™* 


126 amino acid (SEQ I 

NGNLIT^KSGQW^IFSisKGFVElSp^ SVNDLAQKKVGAQKGSIQETMAKDLL CNSSLVSLPK 

lkesgeldklieeal mPVSKGFVB ^^ 


S P127 nucleotide 

CTGTG AG AATC AAGC TAC. 


(SEQ id NO: 225) 


aga GC caccat tt gactacg^^^gggc^ctgaca^cttt ATCGTCCTTGCTACAGCTGGCGA 

TTACT C GCTrGr i ^™^I GCGGCCAATAACTTG AGTTACACAAAAGAGrG^^^;;;:^ GACT 


TGACCAGATCGCTGGTAAAAi 


GAATCAGAAACACACTGATAATCCCGCTAGaamJ AC< ' GGAAGTTC ' I,AA< ' GC ' rCAATTC ATCAATAACTG 

acaccttgctaacggaga^ 

ccgtggtttagacctctcagtcgttga™^^^ 

aagcgaccaaaaagagtttaaagagcaatttgat^^™ CCAGCAATTATATCA ttttct C 

CH^ 27 8,111110 acid (SEQ ID NO: 22 6) 

dsghyqa^SSe^S^ 

?n^ TDNPATIWFSGEDIGKRILDI ^GEFDFLWD?vsWKTTSf QIAGKTTQEDTGTSNAQFINNW 

sdqkefkeqfdkalkelyqdgtleklsntyu^^ltoqsqlq SWDLPSADSPSNYIIFS 


TCCTTGTCAGCAACAAGAAAAA' 


TCCTTTGACTTCTCT 
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Table 2 

5. pneumoniae Antigenic Epitopes 


SP001 

Lys-1 to Ile-10; Leu-13 to Lys-32; Arg-41 to Ile-51; Ser-85 to Glu-97; 
Ala-159 to His-168 ; Val-309 to Thr-318; Val-341 to Asn-352; Asn-415 to 
Met-430; Phe-454 to Asn-464; Ser-573 to Gly-591; Asn-597 to Thr-641; 
and Asn-644 to Ala-664. 

SP004 

Thr-9 to Thr -24; Ile-29 to Ala-48; Thr-49 to Val-56; Val-286 to Val- 
312; 

Pro-316 to Glu-344; Val-345 to Ile-367; Gln-368 to Val-399; Ser-400 to 
Glu-431; Asn-436 to Ala-457; Ile-467 to Ala-498; and Thr-499 to Glu- 
540. 

SP006 

Glu-1 to Lys -13; Pro-24 to Gly-36; Val-104 to Thr-112; Ala-118 to Asn- 
130; Trp-137 to Ala-146; Ser-151 to Ile-159; Ile-181 to Leu-188; and 
Pro-194 to Tyr-202 . 

SP007 

Gly-1 to Asn-7; Tyr-24 to Gln-34; His-47 to Phe-55; Ser-oO to Ala-67; 
Ala-122 to Leu-129; Leu-221 to Lys-230; Val-236 to Phe-256; and Asp-271 
to Gly-283; and Leu-291 to Asp-297. 

SP00 8 

Leu-4 to Lys-17; Gln-24 to Leu-32; Asp-60 to Ser-66; Ser-70 to Asp-76; 
Ala-276 to Lys-283; Asn-304 to Lys-311; and Thr-429 to Pro-437. 

SP009 

Thr-4 to Glu-11; Leu-50 to Asp-60; Ile-102 to Trp-123; and Ser-138 to 
Ile-157 . 

SP010 

Phe-34 to Gly-41; Asp-44 to Lys-50; Leu-172 to Val-186; Leu-191 to Val- 
198; Ser-202 to Ile-209; and Val-213 to Leu-221. 

SP011 

Asn-2 to Thr-10; Asp-87 to Ala-102; Tyr-125 to Glu-132; Thr-181 to Tyr- 
189; Arg-217 to Thr-232; Asn-257 to Lys-264; Pro-271 to Ser-278; Tyr- 
317 to Ala-325; Glu-327 to Pro-337; and Thr-374 to Val-381. 

SP012 

Gly-1 to Lys-19; Phe-34 to Tyr-41; Leu-109 to Lys-126; and Leu-231 to 
Glu-247 . 

SP013 

Ala-1 to Lys-12; Ile-42 to Pro-53; Leu-138 to Lys-146; Ile-205 to Lys- 
217; Ser-235 to Ile-251; and Ser-261 to Tyr-272. 

SP014 

Gly-1 to Val-16; Leu-35 to Leu-44; Asp-73 to Asp-81; Ile-83 to Asp-92; 
Glu-145 to Ile-153 ; Phe-188 to Asn-196; Ser-208 to Phe-215; Ile-224 to 
Leu-231; and Asn-235 to Ala-243. 

SP015 

Ser-1 to Pro-16; Asn-78 to Glu-88; Ala-100 to Val-108; Ala-122 to Thr- 
129; Thr- 131 to Ser-137; Leu-201 to Ser-220; and Gly-242 to Val-251. 


BNSDOCID <WO 9B1S930A2 I > 



WO 98/18930 


PCT/US97/19422 


3NSDOCID; <WO 


Table 2 

s. pneumoniae Antigenic Epitopes 

SP016 

182; Gly-189 to' Arg-l^; ^-2 0^0^1-22^° Pro^SS toT^^ 0 
J27 to Ala-342; and Ser-391 to Ala-402. ' 1 

Z:? 1S '° »-*■ <«-» - 

SP019 

L al s!r-250 G1U ' 117; ASP ' 163 t0 LeU - 169; Th - 182 - ^r-191; and Lys - 24 1 

SP020 

UO^Pro 0 !^ 3 ’ 25 ^^' 47 t0 G1U " 60; Tr P’ 75 to Val-84; Gly-102 to Val- 
110, Pro 122 to Ala-131; and Glu-250 to Pro-258. 1 

»a 4 ^!i 8 o*^“ r “m'" 54 '' Ala ' 117 to Val - 125; Tht - 165 “ ®r- 

SP022 ^ 

I ' ys ‘ 81; TVr_8S “ Gly - 93 ' 

228: 4 Lr< 3 ^ 5 ”'' c ^ 8 ! 2 ”. i, "- S5 -' ^0-238 co 

SP025 

tiV^ 0 Thr_2 ° ; Gly " 2? C ° Ph6 - 33; Gly " 63 - and Ile-93 to 


SP 0 2 8 

Asp-171 to Pro-179; Tyr-340 to Glu-350; Pro-455 to Tvr , a 

to Pro-480. to Tyr-463; and Asp-474 

SP030 

lBO^anrA^^o'' TrP " 81 C ° Ala - 90; Phe-101 to Ala-106; Thr-124 to Tyr- 
±JU ? and Asn-138 to Glu-144. co iyr 

SP031 

Asp-8 to Val-16; Gly-27 to Thr-35; Gly-178 to Asn-19S- Thr- inn ► 

A.P208, Trp-218 to Lou-228, ,„d Lye-226 to I^p-241 ^ “ 

SP032 

Ser-9 to Asp-28; Phe-31 to Val-40; Gly-42 to Ara-50- c, _ r 

A.p-174 to Phe-386; Leu-324 to »et-333; LdX^'to *^ 34 ° “ ‘ 

SP033 

Gln-2 to Ile-13; Phe-46 to Ile-53; and Asp-104 to Thr-121. 

SP034 

to U Ze!-3 t 29 GlY " 43; Ala ' 188 C ° ASP ' 196; Tr P" 313 to Gly-320; and Leu-323 
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Table 2 

3. pneumoniae Antigenic Epitopes 


SP035 

Arg-19 to Asp-36; Asp-47 to Val-57; Asn-134 to Thr-143; Asp-187 to Arg- 
196; and Glu-222 to Ser-230. 

SP036 

Arg-10 to Arg-17; Lys-29 to Ser-39; Ser-140 to Ala-153; Arg-158 to Tyr- 
169; Asp-175 to Ala-183; Gly-216 to Asn-236; Ala-261 to Leu-270; Arg- 
282 to Phe-291; and Thr-297 to Ala-305; Pro-342 to Gln-362; Phe-455 to 
Asp-463; His-497 to Thr-511; Ala-521 to Gly-529; Ile-537 to Val-546; 
Ile-556 to Ala-568; Pro-581 to Ser-595; Glu-670 to Ala-685; Ser-696 to 
Ala-705 and Leu-782 to Ser-791. 

SP038 

Glu-61 to Pro-69; Phe-107 to Ala-115; Leu-130 to Tyr-141; Ala-229 to 
Glu-237 ; Ser-282 to Asn-287; Ala-330 to Glu-338; and Tyr-387 to Glu- 
393. 

SP03 9 

Ser-28 to Asp-35; Pro-88 to Pro-96; Leu-125 to Arg- 13 5 ; ' Phe-149 to Leu- 
157; Gln-246 to Val-254; Ala-357 to Thr-362; Gly-402 to Lys-411; and 
Leu-440 to Pro-448. 

/ 

SP040 

Thr-21 to Ile-30; His-54 to Gln-68; Arg-103 to Leu-117; and Thr-127 to 
Leu- 13 6 . 

SP 0 4 1 

Gly-36 to Asp-49; Leu-121 to Val-128; and Ala-186 to Ile-196. 

SP042 

Gly-11 to Arg- 19 ; Ile-23 to Lys-31; His-145 to Asn-151; Gln-159 to Asp- 
166; Ile-175 to Asp-181; Gly-213 to Tyr-225; Ile-283 to Val-291; Pro- 
329 to Glu-364 ; Arg-372 to Ser-386; Thr-421 to Phe-430; Leu-445 to Val- 
453; Ile-486 to Ala-497; Asp-524 to Ala-535; His-662 to Gly-674; and 
His-679 to Gln-702 . 

5 P 0 4 3 

Lys-2 to Asp-12; Val-58 to Asn-68; Ser-87 to Asp-95; and Asp-102 to 
Lys-117 . 

SP044 

Gln-3 to Lys-11; Asp-37 to Tyr-52; Glu-171 to Leu-191; His-234 to Asn- 
247; and Asn-283 to Ala-291. 

SP045 

Tyr-52 to Ile-63; Asp-212 to Gln-227; Ser-315 to Thr-332; Leu-345 to 
Phe-354; Asp-362 to Val-370; Thr-518 to Asn-539; Ala-545 to Lys-559; 
and Val-601 to Pro-610. 

SP046 

Gin- 9 to Ala-18; Glu-179 to Lys-186; Lys-264 to Glu-271; Gly-304 to 
Glu-17; Ser-503 to Asn-511; Asn-546 to Thr-553; and Asn-584 to Asp-591. 

SP048 
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Table 2 

S. pneumoniae Antigenic Epitopes 

InTlU-foTll 6 ^- 2 I 1 1 r 118; Arg - 131 to 

SP05 0 

^l-3°i5 G1U " 113i Gly ‘ 22 ° C ° Gly - 228; A - 284 - CIU-29S; Tbr-298 
SP05 1 

«*"pro-279? 1 iIl-368 ^"^ 386 ^ 1 ’ ‘ ^ Asp-265 

xzi: 1 « - «;^2r:r £-s;; 4 £ sss “ 

SP052 

Ala \ 117 CO Val “ 139 '- Ile-142 to 
Leu-328 to Val-351- Glv-358 to Th ,oq° Ph€ 283 ' Arg ~ 290 to Ser-299; 


t^"vf 1 -262y*s;“277™rGl 4 y-297. L T^-^i S/* 1 ’ 216 ' Cln - 22a 

409; Thr-414 to Ser-453 • L„ dfii , , Glyn-368; Thr-385 to Ala- 

Gly “630 to Arg-639 ,- and Asp-720 to^u^O G1U " 5?6 “ Thr - 625 '' 

SP054 

Glu-7 to Va 1-28; and Tyr-33 to Glu-44 . 


SPOSS 

Pro-3 to Val-18 ; Thr-21 to Lys-53 
172; and Val-204 to Ser-241. 


Val-34 to Lys-99 


Xle-162 to Val- 


SP05 6 

Val-34 to Tyr-41 

SP057 


Leu-47 to Glu-55 ; and Pro-57 to Gln-66. 


Asp-1 to Val-25 
Glu-172 . 


Pro-29 to Ile-80; Asn-96 to Val-145 


and Pro-150 to 


SP058 

Ala-64 to Thr-70; Leu-82 to His-138; and Val-228 to Asn-236 


SP05 9 

Val-10 to Thr-24 
Val-130; Thr - 1 8 6 


Ser 76 to Pro-102; Ser-109 to Ile-119 
to Ile-194; and Asn-234 to Ser-243. 


Ser-124 to 


SP060 

Leu-70 to Arg-76 ; and Val-79 to Ile-88. 


SP062 

Glu-14 to Lys -28; Ser-32 to Lys-46; and Glu-66 to Thr-74 
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Table 2 

S * pneumoniae Antigenic Epitopes 


SP063 

Ile-10 to Val-25; Val-30 to Thr-40; Asp-44 to Pro-54; Asn-57 to Val- 
63; Pro-71 to Val-100; and Thr-105 to Thr-116. 

SPO 64 

Pro-12 to Leu-32; Val-40 to Leu-63; Asp-95 to Ala-125; Ser-164 to 
Glu-184; Ser-314 to Glu-346; Asn-382 to Val-393; Leu-463 to Gln-498; 
Asn-534 to Lys-548; and Lvs-557 to Gly-605. 

SP065 

Asn-2 to Ile-12; Ala-39 to Thr-61; and His-135 to Ala-155. 

SPO 67 

Gly-1 to Thr-13; Asp-203 to Asn-213; and Gly-240 to Asp-253. 

SPO 68 

Ser-2 to 3er-12 ; Val-17 to Gln-26; and Lys-54 to Cys-67. 


SF069 

Ser-32 to Thr-41; Pro-66 to Glu-80; Thr-110 to Val-122; and Val-147 
to Thr- 180 . 

SP070 

Lys-6 to Tyr-16; Gln-19 to Ile-27; Arg J 50 to Ala-58; Leu-112 to Val- 
128; lie- 151 to Asn-167; Leu-305 to Phe-321 . 

SP071 

Gln-92 to Asn-158; Gln-171 to Gln-188; Val-204 to Val-240; Thr-247 to 
Ala-273; Glu-279 to Thr-338; Pro-345 to Glu-368; Asn-483 to Lys-539; 
Val-552 to Ala-568; Glu-575 to Ser-591; Ser-621 to Gly-640; Gln-742 
to Gly-758. 

SP072 

Val-68 to Tyr -81; Tyr-36 to Val-121; Leu-127 to Gly-140; Gly-144 to 
Ala-155; Gln-168 to Val-135; Asp-210 to Try-241; Glu-246 to Thr-269; 
Lys-275 to Tyr -29 5 ; Gly-303 to Pro-320; Arg-327 to Ile-335; Thr-338 
to Thr-364; Tyr -47 8 to Phe-495; and Tyr-499 to Arg-521. 

SP073 

Glu-37 to Val-45 ; Glu-55 to Val-63; Thr-104 to Thr-119; Ile-127 to 
Tyr -135; Asn-220 to Ile-232; Thr-237 to Ala-250; Ser-253 to Ala-263; 
Glu-284 to Ile-297; and Met-438 to Asn-455. 

SP074 

Gly-2 to Ala-12; Gly-96 to Ile-110; and Thr-220 to Phe-239 . 

SP075 

?he-33 to Tyr- 4 2 ; Gln-93 to Gly-102; and Val-196 to Asp-211, 


SP076 

Ser-64 to Leu-76; and ?he-81 to Ala-101. 

SP077 

Asp-1 to Glu-12; Tyr-26 to Val-36; and Val-51 to Try-62. 
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SP078 

Ala-193 co 
to Gly-422 
Gln-508 . 


5. 


Table 2 


PaeMoDiaa Antigenic Epitope* 


lie 208; Tyr-266 co Asn-275; 
Ser-437 co Pro-464; Thr-492 


Glu-356 co Leu-369; Ala-411 
to Glu-534; and Glu-571 Co 


SP079 

Gly-ll to Leu-20; Lys-39 
158; He-178 to Asp-187; 

SPO80 


to Leu-48; 
Tyr-189 to 


Leu-72 

Gln-201 


to Val-85; Asn-147 to Ser- 
; and Leu-203 to Ala-216 


Ser 2 to Glu-12; Gln-42 
Asp-143; and Ile-159 to 


to Ala-51 
Ile-171 . 


Ala-116 to Ser-127 


Phe-13 1 to 


SPOBl 

Gln-2 to 
145; and 


Leu-9; Gln-49 to Cys-57; 
Trp-154 to Cy s -162. 


He- 108 to Val-131 


Gly-134 to Leu- 


SP082 
Ile-101 co 
to Leu-255 


Ser-187; Gly-191 Co Asn-221 
and Gly-259 co Arg-268. 


Arg-225 to Arg-236 


Tyr-239 


SP083 

Ser-28 to Asp-713. 


SP084 

Leu-42 to Gin-6o; 
Asn-110 . 


Thr-69 to Lys -81 


Glu-83 to Arg-92 


and Gly-98 to 


SP08 5 

Gln-2 co Val-22; and Ser-45 co Glu-51. 


SPO 86 

ueu-18 co Gin- 6 5 ; and Lys-72 co Val-83 


SP087 

Ser-45 Co Leu-53; and Thr-55 Co Gln-63 


SP088 

Pro-8 co lie-16; Leu-2S to 
Val-59 to Arg-67; Thr-55 to 
Leu-115 ; 

Leu-llS co Trp-123; Ala-121 
Gly-184; Tyr-194 to Ala-202, 

SPQ89 


Trp-33; Tyr-35 to Gln-43 ; 
Tyr-63; Asn-85 to Gly-93 


Leu- 51 to Val-59; 
Thr-107 to 


to Thr-129; Tyr-153 
; Ala-217 to Gly-225 


to Ala-161; His- 176 to 
and Asn-85 to Gly-93. 


Phe-114-° ^ys-J^^r^ys-LsrT’p-Je 1 " 93 U° Gln - 101 '- Phe-106 to 
Ile-193 to Tyr-201 ; Lys-203 to Lys-211 • ^1 P -?i \ ^ ' Glu ~ 168 to Gln-176; 
Tyr-245; Lys-247 to Lys-255; Glu-256 ^ t0 Gln ' 220 '' He-237 to 

Lys-286 to Gly-294 * T^d ° 264; Met-275 to Gly-283 * 

S«-323, A.p-^34 \o C ° Thr - 2 ”-' - 

Lys 527 to Arg-535; Phe-537 to Met 545 ° h^ 379 ,! Arg ~ 485 Co Asn-493; 

ro Met-545; and Tyr-549 to Glu-557. 

SP090 
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S . pneumoniae Antigenic Epitopes 

Phe-2 to Gln-10; Gln-13 to Lys-21; Tyr-19 to Glu-27; Tyr-39 to Met-47; 
Pro-65 to Leu-73; Tyr-121 to His-129; Lys-147 to Ile-155; Gly-161 to 
Ly s -169; Gly-218 to Trp-226; Asp-230 to Thr-238; Tyr-249 to Ala-257 ; 
and Ala-272 to Gly-280. 

SP091 

Ser-19 to Ser-27; Asn-25 to Thr-33; Val-51 to Gln-59; Asn-75 to Asn-83; 
lie-103 to Trp-111; Tyr-113 to Ala-121; Leu-175 to Asn-183; Glu-185 to 
Trp-193; Ala-203 to Tyr-211; Val-250 to Phe-258; Asn-260 to Thr-268; 
Ser-278 to Asp-286; Tyr-305 to Leu-313; Asn-316 to Gly-324; Asn-374 to 
Asp-382; Asn-441 to Gly-449; and Ser-454 to Gln-462 . 

SP092 

Arg-95 to Glu-103; Ala-216 to Val-224; Leu-338 to Glu-346; Pro-350 to 
Ala-358; Pro-359 to Ala-367; ?ro-368 to Ala-376; Pro-377 to Ala-385; 
Pro-386 to Ala-394; Pro-395 to Ala-403; Pro-350 to Ala-358; Gin-414 to 
Lys-422; Pro-421 to Asn-429; Trp-465 to Ty r -473; Phe-487 to Tyr-495; 
Asn-517 to Gly-525; Trp-536 to Tyr-594; Phe-608 to Tyr-615; and Asp-630 
to Gly-63 8 - 

SP093 

Gln-30 to Ile-38; Gln-52 to Val-60; Ala-108 to His-116; Tyr-133 to 
Glu-141; Tyr-19,2 to Ala-200; and ?he-207 to Ser-215. 

SP094 

Ala-87 to Val-95; Leu-110 to Cys-118; Gln-133 to Leu-141; Ser-135 to 
Leu-193; Ile-195 to Gly-203; Asp-206 to Gln-214; Ser-211 to Gly-219; 
Ile-241 to Thr-249 . 

SP095 

Arg-1 to Gln-9; Phe-7 to Asn-15; Thr-21 to Asn-30; Leu-46 to Phe-54; 
and Ser-72 to Met-80. 

SP096 

Gly-29 to 1 1 e - 3 7 ; Glu-52 to Ser-60; and Leu-64 to Gly-72 . 

SP097 

Ala-11 to Thr-19; Glu-53 to Glu-61; Ser-91 to Lys-99; Thr-123 to 
Gln-131; and Gly-209 to Lys-217, 

SP09 8 

Thr-3 to Ser-11; Gly-38 to ?he-46; Tyr-175 to Asn-183; Met-187 to 
Cys-195; Gln-197 to Leu-205; Tyr-307 to Gln-315; Gly-318 to Tyr-326; 
Asn-348 to Val-356; Lys-377 to Pro-385 ; and Leu-415 to Val-423 . 

SPO 9 9 

Arg-19 to Gly-27; Asp-76 to Ser-84; Val-90 to Lys-98; Phe-165 to 
Val-173; Leu-237 to Pro-245. 

SP100 

His-111 to Gln-119; Ser-141 to His-149; Asp-154 to Ser-162; Gln-158 
to Gin- 166 ; Asp-154 to Gln-166; Lys-180 to Gln-188; and Ser-206 to 
Gln-214. 

SP101 
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s. pneumoniae Antigenic Epitopes 

Ile-69 ; t Leu-82 co Ile-90 • ^-^08 ^ ' G r ln " 5 ° C ° Ser_53; Thr ' 61 to 
and Leu-130 Co Thr-138. ' ° Leu-116; Gln-121 to Pro-129, 


SP 10 2 

Asp-32 to His-40; Arg-48 Co Lys-56; and Asp-102 Co Thr-110. 

Glnll75? Pro-189 ' to^Glu-iS^ ■ ^ Gln ' 159; Ar 9-167 to 

Ser-233 to Ser-241- Pro-255rn a Leu 215; Ser ~ 219 to Gln-22 

tO r _Val-326; Thr-341~ to° Asp-3$1; ^Asn-356 T'”* — 


Thr-364; Val-370 Co 


He-379 to Gln-387; and Mec-435 co 


Tvr-443 . 


SP105 

Asn-28 to Pro-36; Thr-77 to 
Phe-llS; Asp-169 to Asp-177 
Ala-282. 


?he-35 ; Arg-88 to Val-96; Gly-107 co 
His-248 to Ser-256; and Ser-274 to 


SP106 

Val-10 to Thr -A 8 ; 
Gln-94; Lys-100 to 
Arg-153.. 


Ile-62 to Tyr -70; Ile-71 to 
Thr -108; Phe-132 to Leu-140 


Pro-79; Lys-86 to 
and Asp-145 to 


SP107 

Asp-33 to Val-41; and Arg-63 to Gln-71. 


SP108 


Lys-9 to 
Ser-117; 
Ala-323 


Gln-17; Leu-44 to Ser-52; 
Ile-183 to ile-191; Pro-194 
to Thr-331; and Leu-381 to ■ 


Ser-63 to Phe-71; Tyr-109 to 

to Leu-202; Gly-257 to Gln-265- 
Tyr- 3 89 . 


SP109 


Asn-2 to 
Asp-119 ; 
Asp-196 
Tyr-278 ; 
Gly-335. 


tin'll 6 f la 'l 5 CO LyS * 73; Leu ~ 76 to Glu-84; Thr-111 to 
Val-291 to Hrs-299; Asp-306 to Leu-314; and 8^-327 io 


to 


SP110 

Ser-8 to Glu-16 
to Thr-13 0 . 


lie 37 to Val-45; Ala-107 to Val-115 


and Gly-122 


SPlll 

Asp-19 to Glu-28 ; Leu-43 to 
Ser-141; Phe-162 to Asd- 170 
Ser-281 . 


Ala-51; Asn-102 to 
Tyr-194 to Met-202 


Phe-110; Gln-133 to 
and Asp-273 to 


NSDOCID: <WO 9818930A2J. 



WO 98/18930 


PCT/US97/19422 


104 

Table 2 

S . pneumoniae Antigenic Epitopes 


SP112 

Asp-3 to Gln-11; Gly-21 to Ile-29; Ala-46 to Arg-54; Arg-98 to 
Arg-106; Thr-114 to Val-122; Gln-133 to Asn-141; and Leu-223 to 
Thr-231 . 

SP113 

Asn-19 to Gly-27; Arg-54 to Ser-62; Val-69 to Gln-77; Ser-117 to 
Asn-125; Gly-164 to Leu-172; Tyr-193 to Ser-201; Cys-303 to Phe-311; 
His-315 to Ile-323; Arg-341 to Cys-349; Ile-347 to Ser-355; Arg-403 
to Phe-411; Gln-484 to Pro-492; Ser-499 to Leu-507; Ile-541 to 
Thr-549 

Asn-622 to Ile-630; and Glu-645 to Gly-653 . 

SP114 

Gly-17 to Leu-25; His-40 to Gln-48; Arg-49 to Arg-57; Ile-65 to 
Pro-73 ; 

Asn-101 to Asp-111; Gly-128 to Cys-136; Phe-183 to Thr-191; and 
Pro-268 to lie-276 . 

SP115 

Met-8 to Ser-16; Tyr-24 to Leu-32; Cys-68 to Leu-76; Ser-100 to 
Pro-108; Thr-193 to Thr-201; Gly-238 to Pro-250; Thr-280 to Phe-288; 
Pro-303 to Asiv-312; Trp-319 to Leu-328; Leu-335 to Leu-344; Lys-395 
to Ala-403; Asn-416 to Gln-424; Tyr-430 to Ser-438; Val-448 to 
Leu-456; Leu-460 to Thr-468; Pro-502 to Thr-510; Lys-515 to 

Ile-524; Gln-523 to His-532; Tyr-535 to Thr-543; Ser-559 to 

Pro-567; Thr-572 to Asn-580; 

Val-594 to Arg-602; Arg-603 to Asn-611; Thr-620 to Trp-628; and 
Tyr-644 to Arg-653 . 

SP117 

Ala-6 to Gly-14; Ile-19 to Thr-27; Thr-99 to Leu-107; Ser-117 to 
Asp- 12 5; His -131 to Val-139; Ile-193 to Gly-201; and Val-241 to 
Gln-249 . 

SP118 

Ser-8 to Trp-23; His-46 to Ala-54; Asn-93 to Gly-101; Val-100 to 
Ser-108; Arg-155 to Asp-163; and His-192 to Leu-200. 

SP119 

Tyr-46 to Lys-54; Ser-93 to Ser-101; Trp-108 to Asn-116; Val-121 to 
Glu-129; and Tyr-131 to Gln-139. 

SP120 

Ala-57 to Lys-65; Leu-68 to Glu-76; Thr-103 to Tyr-116; Tyr-122 to 
Val-130 ; His -163 to Gly-173; Asp-188 to Ser-196; Ser-222 to Ser-231; 
Phe-244 to Ser-252; Pro-262 to Tyr-270; Val-283 to His-291; and 
Asp-298 to Leu-306. 

SP121 

Ser-3 to Ala-11; Asp-13 to Leu-21; Ser-36 to Val-44; and Gln-136 to 
Met-144 . 

SP122 

Asn-28 to Lys-36; Glu-39 to Thr-50; Val-54 to Lys-62; Asn-106 to 
Leu-114; Phe-159 to Gly-167; Asn-172 to Arg-180; Glu-199 to Asn-207; 
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Table 2 

pneumoniae Antigenic Epitopes 

Lys-230 to His-241- Acn-X') _ 

to ASP-3S4; Lys-362 to -Thr-370- 393 “ et ‘f 8 C ° Ala-287; Thr-346 

Ala-424; Gly-434 to Gly -443 • IVr 484 Asn " 405 ' Asp-411 to 

Lys -970; Leu-977 to aL 985 iT”® C ° Pr °- 951 '’ Gly-960 

0 Asp-y85; and Leu-988 to 

SP123 
Val-4 to 


Pro-996 . 


Asn- 12 ; Glu -47 to 


Thr-173; Lys-234 ^r\ai-242; L Ser-258 L tr^ t !,« 1U ' 1 °° ; s ^-^5 to 
Tyr ~^ 2? t0 Leu ' 335 -' Tyr-457 to Thr-465- Tvr~ 49 / G1U 284 t0 Asn " 29 2; 

ST 14 - LyS - 517 ~ --^“ 5 ^ " 5 ^ 


to 

Glu-564 . 


SP124 

Tyr - 77 -- •=„ 

r”"i 81 ” Thr-210'to AU-J1S- A<i " _164 « Clu-172, 

s».ns; M7 ’ s "- 312 “ "«-»>■• t r 287 

Mn-60S “ j.sB-S! A *"" 501 to * s n- 509 ; Cln-536 to Ala- 547 , and 


O Asp-21; Ala-28 to 


to Asp-229; 
-321; Asn-324 
Met-402 to 
to 


SP 12 5 

Ser-9 c 
Arg-145 
Pro -221 
to Glu 
Lys-370; 

Gly-529 

SP12 6 
Leu-22 t 

Thr-129; T^ljrti^pro-ue-^r 3 ^^^ 1 ' 1 ?! ^ Thr ' 113; L V S -121 to 

Leu-194 to Asn- 202 : . 152 _. to Ile-160; Lys-180 to Asp-188; 


Asn-155 to Le U -163; L GL-W3''‘o" 4 Lp!l9l''*G! y : 2 ^ 1 ; !37 “ 

a.p-s37, sr-; 5 ^-r« 2 " 


CO Glu-483 , 
Lys -587 , 


Thr-30; Val-65 to Leu-73; and Thr-75 


-o Asp-83 . 


Asn-202 ; and Gly- 2 M~ LyS - 18 ° to 
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Primer 

Name 

SEO ID 

SP001A 

NO: 227 

SP001B 

NO : 228 

SP004A 

NO : 229 

SP004B 

NO : 2 3 0 

SP006A 

NO ; 23 1 

SP006B 

NO : 232 

SP007A 

NO : 233 

SP007B 

NO : 2 34 

SP008A 

NO : 2 3 5 

SP008B 

NO : 2 3 6 

SP009A 

NO: 237 

SP009B 

NO : 2 3 8 

SP010A 

NO: 2 39 

SP010B 

NO: 240 

SPG11A 

NO : 241 

SP011B 

NO: 242 

SP012A 

NO: 243 

SP012B 

NO: 244 

SP013A 

NO : 2 4 5 

SP013B 

'NO': 2 46 

SP014A 

NO : 247 

SP014B 

NO : 248 / 

SP015A 

NO : 249 

SP015B 

NO: 250 

SP016A 

NO : 251 

SP016B 

NO : 252 

SP017A 

NO : 2 5 3 

SP017B 

NO: 254 

SP019A 

NO : 2 5 5 

SP019B 

NO : 2 5 6 

SP020A 

NO: 257 

SP020B 

NO: 258 

SP021A 

NO : 2 5 9 

SP021B 

NO : 2 6 0 

SP022A 

NO : 2 6 1 

SP022B 

NO : 2 6 2 

SP023A 

NO : 2 6 3 

SP023B 

NO : 2 64 

SP025A 

NO : 2 65 

SP02SB 

NO : 2 6 6 

SP028A 

NO : 2 6 7 

SP028B 

NO : 2 6 8 

SP030A 

NO : 2 6 9 

SP030B 

NO : 270 

SP031A 

NO: 271 

SP031B 

NO: 272 

SP032A 

NO : 273 

SP032B 

NO : 2 7 4 

SP033A 

NO : 275 

SP033B 

NO: 27 6 

SP034A 

NO: 277 

SP034B 

NO : 27 8 

SP035A 

NO: 27 9 

SP035B 

NO: 280 

SP036A 

NO : 2 81 


Table 3 

S. pneumoniae ORF Cloning Primers 
Sequence 

GACTGGATCCTAAAATCTACGACAATAAAAATC 

CTGAGTCGACTGGTTGTGCTGGTTGAG 

GTC AGG ATCC AAATTAC AATACGG AC TATG 

CAGTGTCGACTAACTCTAGGTCGGAAAC 

GACTGGATCCTGAGAATCAAGCTACACCCAAAGAG 

AGTCAAGCTTTTGTAACTGAGATTGATCTGG 

GACTGGATCCTGGTAACCGCTCTTCTCGTAACGCAGC 

AGTCAAGCTTTTTCAGGAACTTTTACGCTTCC 

AGTCAGATCTTGTGGAAATTTGACAGGTAACAGCAAAAAAGCTGC 

ACTGAAGCTTTTTTGTTTTTCAAGAATTCATCG 

GACTGGATCCTGGTCAAGGAACTGCTTCTAAAGAC 

AGTCAAGCTTTCACAAATTCGTTGGTGAAGCC 

GACTGGATCCTAGGTCAGGTGGAAACGCTGGTTCATCC 

AGTCAAGCTTATCAACTTTTCCACCTTCAACAACC 

GTCAAGATCTCTCCAACTATGGTAAATCTGCGGATGG 

AGTCCTGCAGATCCACATCCGCTTTCATCGGGTTAAAGAAGG 

GACTGGATCCTGGGAAAAATTCTAGCGAAACTAGTGG 

GTCACTGCAGCTGTCCTTCTTTTACTTCTTTGGTTGC 

GACTGGATCCTGCTAGCGGAAAAAAAGATACAACTTCTGG 

CTGAAAGCTTTTTTGCCAATCCTTCAGCAATCTTGTC 

GACTAGATCTTGGCTCAAAAAATACAGCTTCAAGTCC 

AGTCCTGCAGGTTTTTGTTTGCTTGGTATTGGTCG 

GACTGGATCCTAGTACAAACTCAAGCACTAGTCAGACAGAG 

CAGTCTGCAGTTTCAAAGCTTTTTGTATGTCTTC 

GACTGGATCCTGGCAATTCTGGCGGAAGTAAAGATGC 

AGTC AAG C TTGTTTC AT AGC TTTTTTG ATTG TTT C G 

GACTGGATCCTTCACAAGAAAAAACAAAAAATGAAGATGG 

AGTCAAGCTTATCGACGTAGTCTCCGCCTTC 

GACTGGATCCGAAAGGTCTGTGGTCAAATAATCTTACC 

AGTC AAGC TT AG AGTT AAC ATGGTGC TTGC C AAT AGG 

GACTGGATCCAAACTCAGAAAAGAAAGCAGACAATGC 

AGTC AAGC TTCC AAAC TGGTTG ATCC AAACC A TC TG 

GACTGG ATC C TTCG AAAGGGTC AGAAGGTGC AGAC C 

AGTCAAGCTTCTGTAGGCTTGGTGTGCCCCAGTTGC 

CTGAGGATCCGGGGATGGCAGCTTTTAAAAATC 

CAGTAAGCTTGTTTACCCATTCACCATTACC 

CAGTGGATCCAGACGAGCAAAAAATTAAG 

TCAGAAGCTTGTTTAC CC ATTCACCATT 

GACTGGATC CCTGTGGTGAGGAAGAAACTAAAAAG 

CTGAGTCGACAATATTCTGTAGGAATGCTTCGAATTTG 

CTGAGGATCCGACTTTTAACAATAAAACTATTGAAGAG 

GTCACTGCAGGTTGTCACCTCCAAAAATCACGG 

GACTGGATCCCTTTACAGGTAAACAACTACAAGTCGG 

CAGTAAGCTTTTCGAAGTTTGGCTCAGAATTG 

GACTGGATCCCCAGGCTGATACAAGTATCGCA 

CAGTAAGCTTATCTGCAGTATGGCTAGATGG 

GACTGGATCCGTCTGTATCATTTGAAAACAAAGAAAC 

CAGTCTGCAGTTTTACTGTTGCTGTGCTTGTG 

ACTGAGATCTTGGTCAAAAGGAAAGTCAGACAGGAAAGG 

CAGTAAGCTTATTCCTGAGCTTTTTTGATAAAGGTTGCGCA 

ACTGGGATCCGAAGGATAGATATATTTTAGCATTTGAGAC 

AGTCAAGCTTCCATGGTATCAAAGGCAAGACTTGG 

GTCAGGATCCGGTAGTTAAAGTTGGTATTAACGG 

AGTCAAGCTTGCAATTTTTGCGAAGTATTCCAAGAG 

AGTCGGATCCTTCTTACGAGTTGGGACTGTATCAAGC 


RE 

Sam HI 
Sal I 
Sam HI 
Sal I 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bgl II 
Hind III 
Bam HI 
Hind III 
Sam HI 
Hind III 
Bgl II 
Pst I 
Sam HI 
Pst I 
Bam HI 
Hind III 
Bgl II 
Pst I 
Bam HI 
Pst I 
Bam HI 
Hind III 
Sam HI 
Hind III 
Sam HI 
Hind III 
Sam HI 
Hind III 
Sam HI 
Hind III 
Sam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Sal 1 
Sam HI 
Pst I 
Sam HI 
Hind III 
Bam HI 
Hind III 
Sam HI 
Pst I 
Bgl II 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Sam HI 
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Primer 

Name 

SEO ID 

SP036B 

NO -.282 

SP038A 

NO :283 

SP038B 

NO: 284 

SP039A 

NO: 285 

SP039B 

NO : 286 

SP040A 

NO:287 

SP040B 

NO: 288 

SP041A 

NO: 289 

SP041B 

NO : 2 9 0 

SP042A 

NO : 291 

SP042B 

NO : 2 9 2 

SP043A 

NO: 293 

SP043B 

NO : 2 9 4 

SP044A 

NO: 295 

SP044B 

NO : 2 9 6 

SP045A 

NO: 2 97 

SP045B 

NO : 2 9 8 

SP046A 

NO : 2 9 9 

SP046B 

NO: 300 

SP048A 

NO : 3 0 1 

SP048B 

NO: 3 02 

SP049A 

NO: 303 / 

SP049B 

NO: 304 

SP050A 

NO: 305 

SP050B 

NO: 306 

SP051A 

NO: 307 

SP051B 

NO: 308 

SP052A 

NO : 3 0 9 

SP052B 

NO: 310 

SP053A 

NO: 3 11 

SP053B 

NO: 3 12 

SP054A 

NO: 3 13 

SP054B 

NO : 3 1 4 

SP055A 

NO : 3 1 5 

SP055B 

NO: 316 

SP056A 

NO : 3 1 7 

SP056B 

NO : 3 1 8 

SP057A 

NO: 319 

SP057B 

NO : 3 2 0 

SP058A 

NO : 3 2 1 

SP058B 

NO: 322 

SP059A 

NO : 3 2 3 

SP059B 

NO : 3 2 4 

SP060A 

NO : 3 2 5 

SP060B 

NO : 3 2 6 

SP062A 

NO: 327 

SP062B 

NO: 328 

SP063A 

NO: 329 

SP063B 

NO: 330 

SP064A 

NO : 3 3 1 

SP064B 

NO : 3 3 2 

SP065A 

NO : 3 3 3 

SP065B 

NO : 3 3 4 

SP067A 

NO : 3 3 5 

SP067B 

NO : 3 3 6 


Table 3 

S . pneumoniae ORF Cloning Primers 
Sequence 

AGTCAAGCTTGTTTATTTTTTCCTTACTTACAGATGAAGG 

AGTCGGATCCTACTGAGATGCATCATAATCTAGGAGC 

TCAGCTCGAGTTCTTTGACATCTCCATCATAAGTCGC 

GACTGGATCCGGTTTTGAGAAAGTATTTGCAGGGG 

CAGTAAGCTTGGATTTTTTCATGGATGCAATTTTTTTGG 

GACTGGATCCGACAACATTTACTATCCATACAGTAGAGTCAGC 

GACTAAGCTTGGCATAAGGTTGCAATTCTGGATTAATTGG 

GACTGGATCCGGCTAAGGAAAGAGTGGATG 

GACTAAGCTTTTCATTTTTAAATTGACTATGCGCCCG 

GACTGGATCCTTGTTCCTATGAACTTGGTCGTCACC 

CATGAAGCTTATCCTGGATTTTTCCAAGTAAATCT 

GACTGGATCCTTATAAGGGTGAATTAGAAAAAGG 

GACTAAGCTTCTTATTAGGATTGTTAGTAGTTG 

GACTGGATCCGAATGTTCAGGCTCAAGAAAGTTCAGG 

GACTAAGCTTTTCCCCTGATGGAGCAAAGTAATACC 

GACTGGATCCCTTGGGTGTAACCCATATCCAGCTCCTTCC 

G AC TG TC G ACTTC AGC TTGTTT ATC TGGGGTTGC 

GACTGGATCCTAGTGATGGTACTTGGCAAGGAAAACAG 

ACTGCTGCAGATCTTTGCCACCTAGCTTCTCATTG 

GTCAGGATCCTGGGATTCAATATGTCAGAGATGATACTAG 

CTAGAAGCTTACGCACCCATTCACCATTATCATTG 

GTCAGGATCCGGATAATAGAGAAGCATTAAAAACC 

AGTCAAGCTTGACAAAATCTTGAAACTCCTCTGGTC 

GTCAGGATCCAGATTTTGTCGAGGAGTGTCATACC 

AGTCAAGCTTTCCCTTTTTACCCTTACGAATCCAGG 

GACTGGATCCATCTGTAGTTTATGCGGATGAAACACTTATTAC 

GACTGTCGACGCTTTGGTAGAGATAGAAGTCATG 

GACTGGATCCTTACTTTGGTATCGTAGATACAGCCGGC 

AGTCAAGCTTTGTTAATTGCGTACCTTCTAAGCGACC 

GACTGGATCCAGCTAAGGTTGCATGGGATGCGATTCG 

GACTGTCGACCTGGGCTTTATTAGTTTGACTAGC 

C AGTGGATCC CTATC AC TATGTAAATAAAGAGA 

ACTGAAGCTTTTCTGTCCCTGTTTGAGGCA 

CAGTGGATCCTGAGACTCCTCAATCAATAACAAA 

ACGTAAGCTTATAATCAGTAGGAGAAACTGAACT 

CAGTGGATCCGGATGCTCAAGAAACTGCGG 

GACTAAGCTTTTGCCTCTCATTCTTGCTTCC 

C AG TGGATC CCGAC AAAGGTGAGACTGAG 

ACGTAAGCTTATTTCTTAATTCAAGTGTTTTCTCTG 

GACTGGATCCAAATCAATTGGTAGCACAAGATCC 

CAGTGTCGACATTAGGAGCCACTGGTCTC 

CAGTGGATCCCAAACAGTCAGCTTCAGGAAC 

GACTCTGCAGTTTAATCTTGTCCCAGGTGG 

GACTGGATCCATTCGATGATGCGGATGAAAAG 

GACTAAGCTTCATTTGTCTTTGGGTATTTCGCA 

CAGTGGATCCGGAGAGTCGATCAAAAGTAG 

GTC AC TGC AGTTGCTCGTCTCG AGGTTC 

CAGTGGATCCATGGACAACAGGAAACTGGGAC 

CAGTAAGCTTATTAGCTTCTGTACCTGTGTTTG 

GACTGGATCCCGATGGGCTCAATCCAACCCCAGGTCAAGTC 

GACTCTGCAGCATAGCTTTATCCTCTGACATCATCGTATC 

GACTGGATCCTTCCAATCAAAAACAGGCAGATGG 

GACTAAGCTTGAGTCCCATAGTCCAAGGCA 

AGTCGGATCCT ATC AC AGG ATCG AACGGT AAG AC AAC C 

ACTGGTCGACTTCTTTTAACTCCGCTACTGTGTC 


BNSDOCIO-. <WO 981 893QA2J __ > 


RE 

Hind III 
Bam HI 
Xho I 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Sal I 
Bam HI 
Pst I 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Sal I 
Bam HI 
Hind III 
Bam HI 
Sal I 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Sal I 
Bam HI 
Pst I 
Bam HI 
Hind III 
Bam HI 
Pst I 
Bam HI 
Hind III 
Bam HI 
Pst I 
Bam HI 
Hind III 
Bam HI 
Sal I 
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Table 3 

S . pneumoniae ORF Cloning Primers 

Primer 

Name SEP IP Sequence 

SP068A NO: 337 CAGTGGATCCAAGTTCATCGAAGATGGTTGGGAAGTCC 

SP068B NO : 3 3 8 GATCGTCGACCCGCTCCCACATGCTCAACCTT 

SP069A NO : 3 3 9 TG ACGGATCC ATCGCTAGCTAGTG AAATGC AAG AAAG 

SP069B NO: 340 TGACAAGCTTATTCGTTTTTGAACTAGTTGCTTTCGT 

SP070A NO : 341 GACTGGATCCGCACCAGATGGGGCAC AAGGTTCAGGG 

SP07 OB NO : 3 42 TGACAAGCTTAACTTGTAACGAACAGTTCAATCTG 

SP071A NO: 343 GACTAGATCTTTTTAACCCAACTGTTGGTACTTTCC 

SP071B NO : 3 44 TGAC AAGCTTGTTAGGTGTTACATTTTGACCGTC 

SP072A NO : 3 4 5 ACTGAGATCTTTTTAACCCAACTGTTGGTACTTTC 

SP072B NO : 346 GACTAAGCTTTCTACGATAACGATCATTTTCTTTACC 

SP073A NO : 347 GACTGTCGACTCGTAGATATTTAAGTCTAAGTGAAGCG 

SP073B NO : 348 AGTCAAGCTTGTTAGGTGTTACATTTTGCAAGTC 

SP074A NO: 349 GACTGGATCCCTTTGGTTTTGAAGGAAGTAAG 

SP074B NO: 350 TG ACCTGC AG ACG ATTTTTG AAAAATGG AGG TG T AT C 

SP075A NO : 3 5 1 CAGTGGATCCCTACTACCTCTCGAGAGAAAG 

SP075B NO : 3 52 ACTGAAGCTTTTCGCTTTTTACTCGTTTGACA 

SP07 6A NO : 3 53 CAGTGGATCCTAAGGTCAAAAGTCAGACCGCTAAGAAAGTGC 

SP076B NO: 354 C AGTAAGCTTTAGGGTATCC AAATACTGGTTGTTG ATG 

SP077A NO : 3 5 5 TGACAGATCTTGACGGGTCTCAGGATCAGACTCAGG 

SP077B NO: 356 TGACAAGCTTCAAAGACATCCACCTCTTGACCTTTG 

SP07 8A NO : 3 57 GACTGGATCCTAGAGGCTTTGCCAAATGGTGGGAAGGG 

SP078B NO : 358 / GTC AGTCG ACTTGTTGTAAC ACTTTTCG AGGTTTGGTACC 

SP079A NO: 359 C AGTGGATCCTC AAAAAGAGAAGGAAAAC TTGG 

SP079B NO : 3 60 CAGTCTGCAGTTTCTTCAACAAACCTTGTTCTTG 

SP080A NO: 361 CAGTGGATCCACGTTCTATTGAGGACCACTT 

SP080B NO: 362 CAGTAAGCTTTTCCTTCTCAGTCAATTCTTTTCC 

SP0 8 1A NO : 3 6 3 GACTGGATCCCGCTCAAAATACCAGAGGTGTTCAG 

SP081B NO: 364 GACTAAGCTTAGTACCATGGGTGTGACAGGTTTGAA 

SP082A NO : 3 6 5 C TGAGG ATC C AATTGT AC AATT AG AAAAAG AT AGC 

SP082B NO: 366 TGACAAGCTTGCGTTGACTAGGTTCTGCAATGCC 

SP0 83A NO : 3 67 G ACTGG ATC C TC TG AC C AAGC AAAAAG AAG C AGT C AATG A 

3P083B NO : 3 6 8 TCAGCAGCTGATCATTGACTTTACGATTTGCTCC 

SP084A NO : 3 6 9 GACTGGATCCGTCCGGCTCTGTCCAGTCCACTTTTTCAGCG 

SP084B NO : 370 TC AGAAGC TTATTTTTTGTTTC CTTAATGC GTT 

SP085A NO : 3 7 1 GACTGGATCCGGGACAAATTCAAAAAAATAGGC AAGAGG 

SP085B NO : 372 GTCAAAGCTTTGGCTCTTTGATTGCCAACAACTG 

SP086A NO : 373 GACTGGATCCTCGCTACCAGCAACAAAGCGAGCAAAAGG 

SP086B NO : 3 7 4 GACTAAGCTTACTTTTTTCTTTTTCCACACGA 

SP087A NO: 375 - CAGTGGATCCGAACCGACAAGTCGCCCACTATCAAGACT 

SP087B NO: 376 CTGAAAGCTTTGAATTCTCTTTCTTTTC AGGCT 

SP088A NO : 377 TCG AGGATCCGGTTGTCGGCTGGC AATATATCCCGT 

SP088B NO : 3 7 8 C AGTAAGCTTCCG AACCCATTCGCCATT ATAGTTGAC 

SP0 8 9 A NO : 3 7 9 AGTCGGATCCGGCCAAATCAGAATGGGTAGAAGAC 

SP089B NO : 3 80 TGACCTGCAGCTTCTCATTGATTTTCATCATCAC 

SP090A NO : 3 81 GACTGGATCCATTTGCAGATGATTCTGAAGGATGG 

SP090B NO: 382 TGAGCTGCAGCTTAACCCATTCACCATTCTAGTTTAAG 

SP091A NO : 3 8 3 GACTGGATCCTGTCGCTGCAAATGAAACTGAAGTAGC 

SP091B NO : 3 8 4 GACTAAGCTTATACCAAACGCTGACATCTACGCG 

SP092A NO : 3 8 5 AGTCAGATCTTACGTCTCAGCCTACTTTTGTAAGAGC 

SP992B NO: 386 GACTAAGCTTAACCCATTCACCATTGGCATTGAC 

SP093A NO: 387 C AGTGGATCCTGG AC AGGTG AAAGGTC ATGCT AC ATTTGTG 

SP093B NO : 3 8 8 GACTAAGCTTCAACCATTGAGACCTTGCAACAC 

SP094A NO : 3 8 9 GTCAGGATCCGATTGCTCCTTTGAAGGATTTGAGAGAAACC 

SP094B NO: 390 GACTAAGCTTCGATCAAAGATAAGATAAATATATATAAAGT 

SP095A NO : 391 GACTGGATCCTAGGTCAT ATGGGACTTTTTTTCT AC AAC AAAATAGG 


HE 

Bam HI 
Sal I 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bgi II 
Hind III 
Bgl II 
Hind III 
Sal I 
Hind III 
Bam HI 
Pst I 
3am HI 
Hind III 
Bam HI 
Hind III 
Bgl II 
Hind 111 
Bam HI 
Sal I 
Bam HI 
Pst I 
3am HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Bgl II 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Sam HI 
Hind III 
Bam HI 
Pst I 
Sam HI 
Pst I 
Sam HI 
Hind III 
Bgl II 
Hind III 
Sam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
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Table 3 

S, pneumoniae ORF Cloning Primers 

Primer 

Name SEP ID Sequence 

SP095B NO; 392 TGACAAGCTTATCTATCAGCTCATTTAATCGTTTTTG 
SP096A NO : 393 CTGAGGATCCCAACGTTGAGAATTATTTGCGAATG 
SP096B NO ; 394 TGACAAGCTTGAGTCTACAAAAGTAATGTAC 
SP097A NO : 395 GTCAGGATCCCTACTATCAATCAAGTTCTTCAGCC 
SP097B NO ; 396 TGACAAGCTTGACTGAGGCTTGGACCAGATTGAAAAG 
SP098A NO : 397 GACTGGATCCGAC AAAAAC ATTAAAAC GTC CTGAGG 
SP098B NO : 3 9 8 GACTAAGCTTAGCACGAACTGTGACGCTGGTTCC 
SP099A NO : 399 GACTGGATCCTTCTCAGGAGACCTTTAAAAATATC 
SP099B NO : 400 GACTAAGCTTGTTGGCCATCTTGTACATACC 
SP100A NO : 401 GACTGGATCCAGTAAATGCGCAATCAAATTC 
SP100B NO : 402 AGTCCTGCAGGTATTTAGCCCAATAATCTATAAAGCT 
SP1Q1A NO : 403 CAGTGGATCCTTACCGCGTTCATCAAGATGTC 
SP101B NO : 404 GACTAAGCTTGCC AGATGTTGAAAAGAGAGTG 
SP102A NO : 4 0 5 GACTGGATCCGTGGATGGGCTTTAACTATCTTCGTATTCG 
SP102B NO ; 406 AGTCAAGCTTGCTAGTCTTCACTTTCCCTTTCC 
SP103A NO: 407 GACTGTCGACACTAAACCAGCATCGTTCGCAGG A 
SP103B NO : 4 0 8 CTGACTGC AGCTTCTTGAAGAAATAATGATTGTGG 
SP105A NO : 4 0 9 CAGTGGATCCTGACTACCTTGAAATCCCACTT 
SP105B NO : 4 1 0 C AGTAAGCTTTTTTTTAAGGTTGT AGAATGATTTC AATC 
SP106A NO: 411 CAGTGTCGACTCGTATCTTTTTTTGGAGCAATGTT 
SP106B NO: 412 GACTAAGCTTAAATGTTCCGATACGGGTGATTG 
SP107A NO : 4 1 3 / CAGTGGATCCGGACTCTCTCAAAGATGTGAAAG 
SP107B NO : 4 1 4 GACTAAGCTTCTTGAGTTTGTCAAGGATTGCTTT 
SP108A NO: 415 CAGTGGAfCCCAAGAAATCCTATCATCTCTTCCAGAAG 
SP108B NO : 4 1 6 GACTAAGCTTTTCAGAACTAAAAGCCGCAGCTT 
SP109A NO: 417 GACTGGATCC ACGAAATGCAGGGC AGAC AG 
SP109B NO : 4 1 8 CAGTAAGCTTATC AACATAATCT AGTAAATAAGCGT 
SP110A NO : 4 1 9 CAGTGGATCCTGTATAGTTTTTAGCGCTTGTTCTTC 
SP110B NO: 420 GTCAAAGCTTTGATAGAGTGTCATAATCTTCTTTAG 
SP111A NO; 421 GACTGGATCCGTGTGTCGAGCATATTCTGAAG 
SP111B NO: 422 CAGTAAGCTTACTTTTACCATTTCTTTGTTCTGCATC 
SP112A NO : 4 2 3 GACTGTCGACGTGTTTGGATAGCATTCAGAATCAGACG 
SP112B NO : 42 4 CAGTAAGCTTCGGAAGTAAAGACAATTTTTCC 
SP113A NO : 425 CAGTGGATCCGTGCCTAGATAGTATTATTACTCAAAC 
SP113B NO : 426 GACTAAGCTTTTTGCTTATTTCTCTCAATTTTTC 
SP114A NO : 427 CAGTGGATCCCATTCAGAAGCAGACCTATCAAAATC 
SP114B NO: 428 ACTGAAGCTTATGTAATTTTTTAGATTTTTCAATATTTTTC AG 
SP115A NO : 42 9 AGTCGGATCCTAAGGCTGATAATCGTGTTCAAATG 
SP115B NO : 43 0 GACTAAGCTTAAAATTAGATAGACGTTGAGT 
SP117A NO : 4 3 1 AGTCGGATCCCTGTGGCAATCAGTCAGCTGCTTCC 
SP117B NO; 432 GACTGTCGACTTTAATCTTGTCCCAGGTGGTTAATTTGCC 
SP1 1 8 A NO ; 4 3 3 ACTGGTCGACTTGTCAACAACAACATGCTACTTCTGAG 
SP118B NO : 4 3 4 GACTCTGCAGAAGTTTAACCCACTTATCATTATCC 
SP119A NO ; 43 5 ACTGGGATCCTTGTTCAGGCAAGTCCGTGACTAGTGAAC 
SP119B NO : 43 6 GACTAAGCTTGGCTAATTCCTTCAAAGTTTGCA 
SP120A NO : 4 3 7 AGTCGGATCCCTCGCAAATTGAAAAGGCGGCAGTTAGCC 
SP120B NO : 43 8 GACTAAGCTTGTAAATAAGCGTACCTTTTTCTTCC 
SP121A NO : 43 9 TCAGGGATCCTTGTCAGTCAGGTTCTAATGGTTCTCAG 
SP121B NO : 440 AGTCAAGCTTGGCATTGGCGTCGCCGTCCTTC 
SP122A NO : 441 GACTGGATCCGGAAACTTCACAGGATTTTAAAGAGAAG 
SP122B NO : 442 GACTGTCGACAATCAATCCTTCTTCTGCACTTCT 
SP123A NO : 443 CAGTGGATCCTGTGGTCGAAGTTGAGACTCCTCAATC 
SP123B NO : 444 GACTAAGCTTTTCTTCAAATTTATTATCAGC 
SP124A NO : 445 AGTCGGATCCAACACCTGTATATAAAGTTACAGCAATCG 
SP124B NO : 446 GACTGTCGACTACTTGACCG AATGCGTCGAATGTACG 


RE 

Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Sam HI 
Pst I 
Sam HI 
Hind III 
Sam HI 
Hind III 
Sal I 
Pst I 
Bam HI 
Hind III 
Sal I 
Hind III 
Sam HI 
Hind III 
Sam HI 
Hind III 
Sam HI 
Hind III 
Bam HI 
Hind III 
Sam HI 
Hind III 
Sal I 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Hind III 
Bam HI 
Sal I 
Sal I 
Pst 1 
Bam HI 
Hind III 
Bam HI 
Hind III 
Sam HI 
Hind III 
Bam HI . 
Sal I 
Bam HI 
Hind III 
Bam HI 
Sal I 
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s. 


Table 

pneumoniae ORF 


3 

Cloning 


primers 


primer 

Mane SEQ — ^ 

SP12SA NO: 447 
SP125B NO: 448 
SP126A NO: 449 
SP126B NO: 450 
SP127A NO : 4 5 i 
SP127B NO :452 


m.rsrir ATT AATTG AAATCGG 

CTGAGGATCCATTAGACAGATTAA 

gactgtcgactttaaagattgaagttttaaa^^ 


TGACGGATCCTAAGACAGA' 


CT^GCTTTMKCrrCCTCA^AGTTTGTCT 

gactggatccctgtgagaatcaa ta ggag 

CTGAAAGCTTTTGTAACTGAGATTGAT 


RE 

Sam HI 
Sal I 
Bam HI 
Hind HI 
Bam HI 
Hind III 


BNSDOCIO <WO 981 8930 A2_l_> 



wo 98/18930 


111 


PCT/US97/J9422 


indications RELATING 


TO A DEPOSITED MICROORGANISM 

(PCT Rule 13 bis) 



8NSOOC 'D <WO_98, B930A2j _ > 



PCT/US97/ 19422 


WO 98/18930 


112 


SINGAPORE microorganism shall only be 

puSion of the application. 


NORWAY , i^d open to public inspection 

Effected to an ***%£££ ,U at the time «■» such a truest >tas been 

5=SS , .*£S^~ < — 

case. 


AUSTRALIA mnle of a microorganisin 0 ^ 

... that the furnishing of a samp ^ withdrawal of the 
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What Is Claimed Is: 

1 ' M '*“"^5 “tolVrrXen- from 

having a nucleotide sequence at least 

^g^consi^of^^ encoding any of the atnino actd 

sequences of ^ ° f ^ "" 

sequences in (a). 

, An isolated nucleic actd n»l=cule contprising a 

which hybridizes unde, stnuge™ ’ W o, < W of 

having a nucleotide sequence fes does nol hybndize unde, 

claim 1 wherein said polynucleotid leoude having a nucleot.de 

sl , ingen, "les c/of only T tesidues. 

sequence consisting o y 

. , mnlecu ie comprising a polynucleotide 

3 , An isolated nucleic act Uope .bearing ponion of a 

which encodes the amino add “quern* (a) of Cairn 1. 
polypeptide having an amino acd sequence in (a) 

, wf rlaim 3. wherein said 

epitope-bearing portion of a poiyp v 
Table 2. 

5 a mediod fo, malting a recombinant vecto, comprising .nseriing 

an isolated nucleic acd molecule of chum 1 mto a vecto,. 


6 . A recombinant vector produced by the method of claim 5. 

7. 


7 a method of mahing a — - - — ** 

g . A recombinant host cel, produced by the method of claim 7. 

9 . A metl^ of producing a g "^ 

acid molecule of polypeptide- 
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10 . 


A polypeptide produced according ,o the 


method of claim 9. 


least 70% identical l^as^t^ce^iecKd f mPr * S h"^ ^ *** » 

- sequence of any of,* polypep, £££ -no 

^2. An isolated 


sequence of an S. pneumoniae 


polypeptide antigen comprising 


epitope shown in Table 2. 


an amino acid 




^ 9. 

data , I*' ^ ,S ° ,aBd *- ** specifically , Q , 


Of 


15. 


A Mtridotna which produces an antt^y accotding daim R 
^ 6. A vaccine, comprising: 

SroupiLtin^ Polypeptides selected f,„ m ^ 

“ orl g SZ:tl7 mmg “ ~ 

where, Z* ^ ^ « “*.*« 

member of the °' a ® nuatin S ™ —Cion caused by a 

said animal a polypeptide of claim „ ****** '° 

m .0 amount effective to pteven, or ^ * «*«-- 

sample obtained from an “ 3 bi0l ° 8ica ' 

sequences encoding S,rep,ococcus poly^ptides n a^ T " m ° rc ““ 

(a) contacting the samnlia SampJe Uprising: 

more Streptococcus nucleic acid sequences present in the^ bidto^cal^ampig” 6 ^ 
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19. A method of detecting Slreptococcus nucleic acids in a bio g 
sample " udeic ac,d " in sa,d 

2 0 A kit for detecting S,rep<ococcus antibodies in a biological 
(b) detecting means. 

2 , A method of detecting S,rep,ococcus antibodies in a bio, og, cal 
. . , . ;var.fi v-antieen complexes. 
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1. Claims: (1-21) partially 

An isolated nucleic acid j_ c ^Suence^at least 95% 

polynucleotide having a insisting of of: (a) 

identical to a sequence from the grou^.^ ^ the amino 
a nucleotide sequence S ^Q 1 n J.2 shown in Table 

acid sequence of commentary, to said 

i. or (b) a nucleotide sequence tu v nucleic a cid 
nucleotide sequence in (a); an ’L de w hich hybridizes 
Mlecule compr^ingapo^uclMUde^^^ having a 
under stringent conditions to a p °' J ucleot ide sequence in 
nucleotide sequence *° leo tide which hybridizes 

(al or (b), wherein said hybridization conditions 

does not hybridize under stringen y quence consisting 
?o a Synucleotide having. a nucleotide seq - c acld 

S only A or of only T residues; an iso » a J e encodes the amino 

^lecule comprising a polynucleotide wM a polypeptide 

5?id sequence or an epitope-bearing porx^^ ^ (a); sald 

having an amino acid sequenc polypeptide has an am ™ 

epitope-bearing portion of a: V «thcS of making a 
trid sequence listed in Table 2, said recombinant 

using said isolated ^S^a^ombinant host cell using 
rairvU^d^fSaat ^-^^£1.5*1 

producing caid poiypep'id^ a hyhridoma which 

antibody that binds to said po«yp v ising said 
produces said antibody; a vaccine comn u or a 

polypeptide selected from SEQ ID no.^^ ^ attenuatir? g an 
fragment therof; a method P cv re ptococcus genus in 

infectioncausedbya memberufStrept^ ^ ting 

above-described nucleic acdd p^be e^uj^^ u$)tig sa ,d 

Streptococcus antibodies m a-omi y 
polypeptide sequence; 

2-113. Claims: (1-21) partially 

-Idem as subject 1 but limited to ^e sequenceS Q av ^ 3 
10 nos. 3 to ZZednvention Zt^s^ I0 nQS 5 d 6 ; 

to SEQ 10 not. 225 end 22 )• 

+w»o first group is expiic^^y 

defined, ’the Sh^rou^Slfined » ^ 
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